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Chapter  I 


1991  AFOSR/RL  Workshop  on  Intelligent  Information  Systems  (IIS) 

The  IIS  workshop  brought  together  researchers  in  the  field  of 
Artificial  Intelligence  and  Data  Bases  from  a  number  of  various 
sources.  Represented  were  various  government  agencies  that 
included  Air  Force,  Army,  Navy,  Darpa,  and  NASA,  several  university 
research  groups  were  represented,  and  finally  government 
contractors  were  represented. 

The  workshop  agenda  included  a  tightly  integrated  set  of  tutorial 
presentations,  individual  research  thrusts  in  the  IIS  area,  and  panel 
discussions.  The  AFOSR  research  program  in  the  area  was 
represented  by  several  by  a  series  of  talks  that  concentrated  on  an 
integrated  research  program  in  robust,  cooperative,  and  uncertain 
knowledge-bases.  Rome  Labs  cooperative  program  with  Darpa  in 
knowledge-base  planning  and  scheduling  was  presented  to  the 
research  group  assembled  and  provided  an  application  scenario 
where  many  of  the  research  thrusts  could  applied. 

Those  attended  agreed  that  the  workshop  was  extremely  helpful  in 
understanding  Air  Force  problems  in  the  information  area.  The 
details  of  the  Darpa/RL  Planning  and  Scheduling  Initiative  defined 
specific  data/knowledge  problems  in  the  large  application  areas  and 
focused  on  possible  research  areas. 

A  lively  two  days  of  interchange  was  culminated  by  a  panel 
summarizing  the  workshop  and  recommending  future  directions. 

The  conclusions  reached  included: 

unanimous  support  that  the  workshop  should  be  held  again. 

need  exists  to  establish  a  consortium  aimed  at  developing  a 
rich  knowledge  base  to  be  made  available  for  community  test 
and  evaluation. 

ways  to  apply  technology  of  Artificial  Intelligence  and 
intelligent  data  bases  must  be  explored  and  expanded. 

Possible  areas  of  research  include: 


ii 


mediating  databases 
cooperating  databases 

approximate,  incomplete,  uncertain  computation 

buggy  databases 

reasoning  about  information 

performance,  storing  information  issues 

new  information  language 

high  performance  AI/DB  paradigms 

general  purpose  problem  solvers 

Other  notions  regarding  the  workshop  included: 

A  panel  should  immediately  immediately  follow  each  topical 
area  to  maximize  discussion  of  the  area  and  gain  specitic 
recommendations. 

Expand  list  of  invites. 
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AGEMDA 


final  program 

INTELLIGENT  INFORMATION  SYSTEM  WORKSHOP 


The  following  Is  the  program  for  the  Workshop 

on  Intelligent  Information  Systems,  jointly  sponsored  by  AFOSR 

and  Rome  Laboratory.  The  workshop  will  be  held  at  Rome  Labs,  Building  3 

C3  Conference  Room  1,  Griff iss  Air  Force  Base,  New  York, 

on  October  22,23  1991. 


Tuesday  October  22,  1991 

7:15  -  8:00  Registration*:  Front  Lobby  Building  3,  Griff iss  AFB,  NY 

8:00  -  8:15  Welcome  (AFOSR/RL)  ~  Dr.  Fred  Diamond  -  Rome  Labs 

8:15  -  9:45  “Two  Decades  of  Knowledge-Based  Systems:  A  Task-Oriented 

Overview",  b.  Chandrasekaran ,  Ohio  State  University 

9:45  -  10:00  Break 

10:00  -  11:30  "Timing  Issues  in  Real-Time  Information  Systems", 

Jane  W.S.  Liu,  University  of  Illinois 

11:30  -  12:45  Working  Lunch:  "Overview  of  DARPA/RL  Knowledge-Based 
Planning  and  Scheduling  Initiative" 

Donald  Roberts,  Rome  Laboratory 

12:45  -  2:15  "Cooperative  and  Informative  Answers  for  Deductive 
Databases",  Jack  Minker,  University  of  Maryland 

2:15  -  2:30  Break 

2:30  -  4:00  "Probabilistic  Models  of  Retrieval", 

Bruce  Croft,  University  of  Massachusetts 

4:00  -  4:15  Break 

4:15  -  4:45  "General  Approach  to  Schema  Merging" 

Peter  Buneman,  University  of  Pennsylvania 

6:00  -  7:00  Social  Hour  -  Reception 

Quality  Inn  Red  Coach  Restaurant 

Comor  S.  James  Street  and  Brie  Blvd. 

Rome,  New  York 

Wednesday  Oct  23 

8:00  -  8:30  "Intelligent  Databases  for  Planning  and  Scheduling" 

Tim  Flnin,  Unisys  Corporation 

8:30  -  9:00  "Extracting  Rules  from  Software  for  Knowledge  Bases" 

Noah  Prywes,  U.  of  Pennsylvania 

9:00  -  9:30  "Heterogenous  Knowledge  Bases", 

Gio  Wiederhold  -  Darpa 


9:30  -  9:45 

Break 

9:45  -  10:15 

"Services  and  Information  Management  for  Decision  Systems 
(SIMS)"  Yigal  Arens,  University  of  Southern  California 

10:15  -  10:45 

"Ariel  Database  Rule  System” 

Eric  Hanson,  WL/Wright  State  Univ. 

10:45  -  11:13 

"Knowledge  Organization" 

B.  Chandrasekaran,  Ohio  State  Univ., 

11:15  -  11:45 

"Intelligent  Data  Bases" 

Jack  Minker,  Univ.  of  Maryland 

11:45  -  1:00 

Working  Lunch  "COBASE:  Cooperative  Distributed  Database" 
Wesley  Chu,  UCLA 

1:00-  1:30 

"Retrieval  with  Partial  Information" 

Bruce  Croft,  Massachusetts  Univ. 

1:30  -  2:00 

"Monotone  Databases” 

Jane  Liu,  Univ.  of  Illinois 

2:00  -  2:30 

"Representing  and  Reasoning  About  Temporal  Information" 

Mark  Boddy,  Honeywell  Infomation  Systems 

2:30  -  3:00 

"Highly  Programmable  Architectures  for  Knowledge  Base 
and  Natural  Language  Processing",  Christine  Montgomery, 
Language  Systems  Inc.,  Jean-Luc  Gaudiot,  Univ.  of 

Southern  California 

3:00  -  3:30 

"  Data  Base  Reasoning  in  Engineering  Design" 

Katia  P.  Sycara,  Carnegie  Mellon  University 

3:30  3:45 

Break 

3:45  -  4:15 

Panel  Discussion  "Future  Directions  of  Intelligent 
Information  Systems” 

(All  Participants  Provide  2-4  Minute  Position  Statement) 

4:15  -  4:45 

Wrapup 

•  There  will  be  a  minimal  charge  for  coffee  and  refreshments  -  S3. 00 


Also,  there  will  be  a  S5.00  charge,  for  hors  d'oeuvres,  for  those 
attending  social  hour  with  pay  as  you  go  bar. 


Lodging  Information: 

Beeches-Paul  Revere  Lodge 
Route  26 
Rome,  New  YorJc 
315-337-1775 


Quality  INN 

Erie  Blvd.  6  S.  James  Street 
Rome ,  New  Yorlt 
315-336-4300 


Consort  Horizon  Airport  Inn 
Oneida  County  Airport 
Orlskany,  New  York 
315-736-3377 


Radlsson  Hotel  Utica  Center 
Genesee  Street 
Utica,  New  York 
315-797-8010 
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APPENDIX  A 


Knowledge-Based  Systems: 
A  Task-Oriented  Overview 


B.  ChandraMkaran 
Laboratory  for  Al  Raaaarch 
Tha  Ohio  Stata  Univarahy 
Columbua,  OH  43210 


Email:  chandra@cia.ohk>>8tata^u 


Laboratory  tor  Al  Resaarcb,  The  Ohio  Slate  Univeraity 


Goals  for  the  'talk 

1 .  TYPES  OF  INFORMATION  PROCESSING  PROBLEMS 
AND  HOW  APPROPRIATE  Al  IS  FOR  THBR  SOLUTIONS 

2.  TYPES  OF  Al  ARCHITECTURES  THAT  ARE  UNDER  DISCUSSION  IN  Al 

3.  TASKS,  METHODS  AND  KNOWLEDGE 

4.  ARCHITECTURES  FOR  KNOWLEDGE-BASED  SYSTEMS 


Laboratory  tor  At  Research,  The  Ohio  Stata  Univeraity 
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INFORMATION  PROCESSING  PROB1.EM  I'YPES  AND  Al 


_  I 

1.  TYPES  OF  ^FORMATION  PROCESSING  PROBLEMS 

AND  HOW  APPROPRIATE  Al  IS  FOR  THEIR  SOLUTIONS _ 

2.  TYPES  OF  Al  ARCHITECTURES  THAT  ARE  UNDER  DISCUSSION  IN  Al 
3  TASKS.  METHODS  AND  KNOWLEDGE 
4.  ARCHITECTURES  FOR  KNOWLEDGE-BASED  SYSTEMS 


lAbocatocy  tor  Al  Ressareh,  The  Oho  Stale  Univeriity 


Types  of  IP  Problems  and  Apprctpriateness  of  Al 


Information  Processing  Task  and  Computer  Programs 
Task  Types: 

Typel 

•Ctosed  From  Algorithms  Exist 

•Information  Needed  for  Running  the  AtgorKhms  Available  in 
the  Domain 

•Time  and  Space  Requirements  are  Tractable 
E.G.: 

FEM,  Multiplication  Routines,  Sorting  Programs 

Laboraioty  tor  At  ntitarch.  The  Ohto  8Me  IMveraHy 
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Type  2  Problems:  Approriate  for  Al  Technicjues 


Type  2 

*  Algorithms  of  Type  1  are  not  Applicable 

Because,  e.g..  They  are  not  Knowm, 
Information  Needed  not  AvaiisriMe, 
or  the  Aigorithms  are  not  tractable. 
(Takes  Too  Long,  e.g.) 


*  But  in  the  Domains  of  interests.  There  Exist  Human 

Problem  Severs  (Experts)  who  Routinely  Solve  Versions 
of  the  Problem  Well  Enough 

*  Qualitative,  in  General 

*  Humans  May  Do  it  by  a  Recogn^bn  Process  or  a 
Delberative  Process 

*  e.g.:  Diagnosis,  Planning.  Design  in  specific 
domains 


Laboratory  for  Al  Rasoarch,  Tha  Ohio  Staia  Univarsity 


Ty  pe  3  Problems 


Types 

•Not  Types  1  or  2.  but  a  Solution  May  be 
Possftrle  with  Al  Methods  of  Search. 

(Heuristic  Search  in  Large  Spaces.) 

—  Traveling  Salesman  Problem,  Search 
for  Patterns  in  Large  Databases 


Will  Concentrate  in  this  Tak  on  Type  2  Tasks  as 
Appropriate  for  Expert  System  Approach 


Laboratory  for  AIRataareh,  Tha  Ohto  Stait  Univanity 


DEL18ERA  riVE  VS  RECOGNITION  ARCI I JTECI'liRES 


1.  TYPES  OF  INFORMATION  PROCESSING  PROBLEMS 
AND  HOW  APPROPRIATE  Al  IS  FOR  THBR  SOLUTIONS 

2.  TYPES  OF  Al  ARCHITECTURES  THAT  ARE  UNDER  DISCUSSION  IN  Al 

3.  TASKS.  METHODS  AND  KNOWLEDci 

4.  ARCHITECTURES  FOR  KNOWLEDGE-BASED  SYSTEMS 


LaboraRKyforAIRMMfCh,  Th*  Ohio  Stai*  UnIvarsity 


MeT3Pbor5?  tVir  ProWern  Solving  in  Al 


Connectionist  nets 


Reasoning 

-  Knowledge 

-  Rules  of  Inference 

Problem  Solving 

-  initial  state 

-  Goal  state 

-  Operators 

-  Search 


-  Networks 

-  Frames 
-Cases 


Laboratory  for  Al  nonorch,  Tho  Ohio  Stafo  Unhraraiiy 
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Metaphors  for  Jh-oblein  Solving  in  Ai:  Contd 


Deliberation :  Explicit  operation  of  manipulaiing  knowledge 
to  produce  new  states  of  knowledge 

*  Logic  --  Operation  on  predicates  and  clauses  by  means 

of  Inference  Rules  (normally  soundness  is  a 
major  concern). 

*  Problemsolving  as  search  in  a  State  Space ,  where  each 

state  corresponds  to  a  knowledge  state,  and 
operators  change  states,  goal  is  to  achieve 
kimwledge  corresponding  to  goal  state. 

Duality  between  the  logic  view  and  the  seardi  view 

--  Need  for  search  in  the  logk  view  as  well  -  many  inferences 
can  be  made,  few  of  which  lead  to  the  goal 
~  State  expansion  operators  in  the  state  space  view  can  be  deemed 
to  be  a  l^d  of  inference  rule. 


Laboratory  for  AI  Researeb,  The  Ohio  State  Univeraity 


Mycin  and  R1  as  Search  Systems 

All  knowledge  systems  have  to  have  knowledge  to 

*  Set  up  alternatives  (Elements  of  the  initiial  state) 

*  Generate  new  {voblem  states  (inference  rules  in  logic,  operators  in  GPS) 

*  Control  Search. 

The  Diagnostic  Part  of  Mycin: 

Initial  State  —  Observations,  but  no  knowledge  of  causative  organism 

Goal  state  ~  Knowledge  of  causative  organism 

Problem  space:  The  hierarchy  of  organisms 

Operators  for  Statte  Expansion :  "Subclass  of  " 

Seardi  control  knowledge:  heuristics  about  what  to  consider,  when  to 
prune  space  of  hypotheses,  etc. 


Laborafory  for  AI  naaeareh,  'The  Ohio  Siaia  Unhreralty 
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R1  jis  Problera  Space  Searcher 


R1  as  a  Configurar: 

Initial  state ;  A  list  of  components,  some  connceted 
-  Goal  state  :  All  components  connected  appropriately 
Operators:  Various  oonnectnn  (praters 
Search  control  knowledge :  Local  configuration  tests 


Laboratory  for  Al  Reseweh,  The  Ohto  Stala  Univeraity 


Mon-deliberative  phenomena 


Memory  :  "One-shot"  phenomena:  bdiavior  of  interest  is  produced 
as  "one  mental  cycle"  in  humans 

"  matching,  retrievaL  classification,  recognition  common  task-level 
descriptions 

—  Case-based  reasoning,  oonnectionist  (neural)  networks 
—  Symbolic  versus  "continuous"  mathematics  rqnesentations 


Laboratory  for  Al  Research,  The  C3hto  State  Univoraity 
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A  connectionist  Network 


Connectionijit  Nouvorks  “•  Continued 


This  style  of  network  can  be  seen  as  doing  evidence  combination  at  multiple 
levels  (hidden  units  correspond  to  intermediate  abstractions) 


The  input  data  can  be  observations  of  a  mechanical  system,  the 
hidden  units  can  be  intermediate  hypothesis ,  and  mdfiinctioons  can  be 
decision  units. 

Quite  independent  of  the  connectionist  nature  of  the  ardntecture,  we  can 
say  that  the  system  is  combining  evidence  firom  groups  of  observations 
to  form  intermediate  hypotheses,  which  in  turn  are  combined  to 
evaluate  evidence  for  various  malfunction  hypotheses. 

The  Cormecdonist  system  then  does  diagnoris  by  matching  and  evidence 
abstraction. 


Laboratory  tor  Al  Ressareh,  lha  Ohto  Swa  Univaralty 


Case-Bas?d  Rei^  soiling 


Problemsolving  as  Retrieving  a  "similar”  problem's  solution  from  memory 

and  "tweaking"  it. 

Some  sort  of  associative  matching,  based  on  indexes  to  the  cases  in  memory 
Tweaking  still  involves  deliberative  search-like  activi^ 

"  one  still  has  to  consider  a  number  of  possible  modifications,  select 
some ,  possibly  backtrack,  etc. 

~  Thus  memory  does  a  "best  match"  retrieving,  while  deliberation 
does  additional  problem  solving 

-  We  need  task-based  theories  of  indexing 

~  how  do  we  index  cases?  What  is  the  connection  between  kinds  of 
tasks  and  the  kinds  of  indices?  Some  work  has  been  done 
in  this  connection  on  planning  (Hammond),  diagnosis  (Kolodner), 
and  design  (Goel ). 


Laboratory  Ibr  Al  Research,  The  Ohio  Slate  Universiiy 


lASKS 

1.  TYPES  OF  INFORMATION  PROCESSING  PROBLEMS 
AND  HOW  APPROPRIATE  Al  IS  FOR  THEIR  SOLUTIONS 

2.  TYPES  OF  Al  ARCHITECTURES  THAT  ARE  UNDER  DISCUSSION  IN  Al 

3.  TASKS,  METHODS  AND  KNOWLEDGE 

4.  ARCHITECTURES  FOR  KNOWLEDGE-BASED  SYSTEMS 


Laboratory  tor  Al  Research,  The  Ohio  State  University 
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Knowkdg?  Lesrl  Y$  Sv-nibol  in  AI 


*  Newell's  Knowledge  Level  Vs  Symbol  Level  Distinction 

Too  much  of  the  discussion  in  Knowledge-Based  System 
is  at  the  Symbol  Level,  i.e.,  at  the  level  of  implementation 
details  that  obscure  the  structure  of  the  task  that  is  being 
solved. 

*  The  language  of  tasks,  and  what  it  takes  to  accomplish  them 
(methods)  is  useftil  to  make  sure  that  die  mechanisms 

fit  the  requirements  of  the  task. 


Laboranxy  (or  AI  RMoardi,  Ih*  Ohio  Sato  Un^orsity 


Phenomena  at  tlie  Symbol  Level 


*  Rules 

*  Frames 

*  Connectionist  networks 

*  ATMS 

*  Nexpert 

*  Kee 

*  Problem  Spaces 


*A11  of  these  make  some  degree  of 
commitments  to  an 
implementation  formalism 


*They  all  have  a  place,  butaherthe 
structure  of  the  task  is  delineated. 


*Often,  how  to  organize 

knowledgeand  what  kinds  of 
knowledge  are  needed  are 
obscured  by  an  overemphasis  on 
the  symbol  level 


Laboratory  lor  AI  Resoarch,  Tho  Ohio  Saa  University 
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Pn)bleiTJs  \vith  Traditional  Picture  of  KBS's 


Knowledge  separated  from  use 

No  theoiy  of  how  the  task 
Constrains  the  type  of 
knowledge  in  the  KB  and  the  type 
of  task-l^el  oonlrol  needed 

Often,  task-level  control  is 
embedded  in  the  rules  and 
invisible.  Instead  of  control 
"emerging,”  it  is  simply 
hidden  and  obscured.. 


Latxmaxy  tor  Al  Research,  the  Ohio  State  University 


Myoin  as  Heuristic  Oassifier 


Claiicey  analyzed  the  diagnostic  part  of  Mydn  aid  showed  that  it  was 
performing  a  type  of  p^lem  solving  c^ed  Hemistic  Ctassiflcaitiaii. 


*  The  Rule  system  is  merely  an  implementation  medium  for  the  above  inference  structure. 

*  This  structure  was  implicit  and  hidden  in  the  rules  -  had  to  be  brought  out  for  explanation. 

Laboratory  tor  At  Research,  The  Ohio  Stats  Unkrerahy 
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R1  Asa  Subtasker 


Similariy,  the  R1  System  decomposed  the  design  problem  into 
a  series  of  subtasks. 

The  subtask  structure  was  implemented  in  OPS-S,  but  was  implicit. 

This  led  McDermott  and  associates  to  study  how  to  make 

*ttse*  the  constraints  of  the  task  for  structuring  the  problem  Solving 
activity  and  also  use  those  constraints  for  knowledge  acquisition. 
He  launched  a  snidy  of  the  *iDles*  of  knowledge  in  various  tasks. 


Laboratory  (or  At  Research,  The  Ohio  Stale  Univaraity 


Coni  piled  and  Deep  Mtxlels 


Laboratory  Ibr  A)  Reaeareh,  The  Ohio  State  Univenity 
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Tasks,  MeilKxJs  and  Knowledge 


U0thod:  Characterized  by  Operators  operating  on  Objects 

Plus  krtowledge  (Declarative  or  embedded)  about  how  to 
organize  operator  application  to  satisfy  goal 

Different  methods  may  call  for  different  types  of  knowledge 


LaboraiaryfarAIRMMreh,  Ih*  Ohio  Sni»  Un^aniiy 


Exainples  of  Methods 


Examples: 

1 .  Multiply  two  multi-digit  numbers 


■Logarithmic*  Method 


Extract  Logarithms  of  input  numbers 
Add  the  two  logarithms 
Extract  anti-logarithm 
Operators  underlined 
Objects:  Input  numbers 

Various  results  of  operator  appiicatbn 


labotaiBfy  ter  Al  ninwch.  Iho  Ohto  SOM  UrWamiy 


Methods  Set  Up  Siibtasks 

Subtasks:  Conditions  for  operator  appiicatnn  may  not  be  satsfied 

1.  Operator  not  primitive 

e.g.  ’Extract  logarithm*  may  require  setting  up  as  subtask 

2.  Objects  needed  may  be  missing 
e.g. 


Task: 

Diagnosis 


Operators;  Establish  Hypothesis 
Refine  Hypothesis 

Objects:  Diagnostic  Hypotheses 

Some  of  the  objects  may  be  missing,  or  operator  preconditions  may  not 
be  satisfied  Laboratory  tor  Al  naaaateft.  Tha  Ohio  Staia  Umrartoiy 


Method:  Hierarchical 
Ciassfication 


"IDccp"  Mcxleh  Used  to  Generate  Knowledge  Needed  ftTr 
Objects  and  Opcnifors  of  a  Method 


If  these  objects  not  directly  available,  set  up  subtask  of 
generation  of  diagnostic  hypotheses 


’Deep' Model  also  useful  for  operator  appKcat'ion:  ’Estabfeh’ 
Hypothesis  (simulation  of  device). 

’POINT;  Depth  of  knowledge  is  relallveiy  to  a  method  tor  a  task. 

Tha  Ohio  SMtUntoareiy 
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Types  of  Methcxis 


TypM  of  methods 

;  Well-defined  ’closed-form’ algoridims  (tractable) 

:  So-Called  ’Problem-solvmg’ methods 

Simon's  Classificatjon  of  Problem-solving  Methods  in  Al 
-Problem-Spsrce  Search  Methods 
-’Reasoning’  Methods 
-  ParalM,  constraint-satisfaction  Med)ods 

Newell  ;  Algorithms  Special  Cases  of  Search  Methods,  where 

there  are  no  choice  points,  the  selections  have  already  been  made 
when  the  algorithm  was  composed. 

*  No  finite,  mutually  excl.  set  of  methods 

Laboratory  tor  Al  Roaaareh,  Tha  Ohio  Stata  Univartity 


A  McliitXj  Caii  Be  a  Fn>blem  Space:  'Hie  HC  and  MDX  Examples 

In  Neweirs  Problem-Space  Framework,  each  problem-space  set  up 
for  a  goal  is  a  method.  A  method  can  also  be  a  name  given  to  a 
sequence  of  pre-compiled  problem-spaces. 


Goal 


Problem- 
Space  1 


Subgoal  Problem 


Problem 

Spaces 


Difft  Methods  call  for  Dif.  Types  of  Knowledge. 

Heuristic  Classification  (Qancey);  MOX  (Chandrasekaran,  et  al) 
PS1 :  Space  of  Hypotheses 
PS2;  Space  of  Credibility  Assignments 

PS3:  Space  for  Data  Abstraction 

Laborsionf  tor  Al  nonwch,  lb*  Ohto  Sim*  Urtivorfliy 


A-14 


The  I>esign  Task 


Complete  specifications  of  a  set  of  'primitive*  components  and 
their  relations  (*oonnections*)  so  as  to  meet  sat  of  functions  or 
goals  while  satisfying  a  number  of  specified  constraints. 

-Domain-independent  character  of  Design 

(planning,  programming,  mechanical  design, ...) 

-  Design  not  unitary  process 

~  Repertoire  of  strategies  each  using  specific  forms  of  knowledge  & 
inferences 

-  Differences  in  design  process  in  different 

domains  function  of  available  knowledge 


Laboratofy  (or  Al  RaaMreh,  The  Ohio  State  Univaraity 


'I'he  PVCM  Method  Family 


Design 

Task 


Propose  Design 
Critique  &  modify 
Family  of  Methods 


Numerous  Ways  in  which  subtasks  can  be  set 
up  &  invok^  &  combined  ^  > 

Laboratory  lor  Al  RoMarch,  The  Ohio  Sta»  UnIvanIty 
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The  PVtrM  Methtui  IVopose  MeUiods 


Design 

Task 


Propose  Design 
Critique  &  modify 
Family  of  Methods 


Laboratory  for  Al  Research,  the  Ohio  State  University 


Methods  for  Proposing  Design  Choices 


*  Design  Problem  Decomposition/Solution 

composition 

-  Design  plans  a  special  case 

*  Retrieval  of  cases  from  memory  for 

similar  problems 

*  Families  of  Optimization/Constraint 

satisfaction  methods 


Laboratory  for  Al  Research,  the  Ohio  Stats  Univeraiiy 


A-16 


Deconi  position  Methoiis 


Knowledge  Needed 

D - >D1.D2....Dn 

Di's  *smaller  problems  (smaller  search  spaces) 

-  if  Altemate  Decompositions  possible,  then  search  in 
decomposition  problem  space  takes  place 

*  Repeated  applications  result  in 

design  hierarchies,  no  search 

Subtasks 

*  Translating  from  specification  of  D  to  specifications 

*  Composing  subproblem  solutions  into  problem  solutbns 

Laboratory  tor  Al  Research.  The  Ohio  State  University 


Decorn position.  Con dnueti 


Information  Needed: 

-  How  goals/constraints  of  D  get  translated 

into  corresponding  ones  for  Dfs. 

-  How  to  glue  solutions  back 

-  May  be  given  as  part  of  decomposition  knowledge  or  as 

auxiliary  problem  solving 

-  CRITTER  System  (Kelly)  is  a  domain-specific  simulation  facility 

for  generating  constraints  for  Di's  &  gluing  them  back 


Laboratory  lor  Al  Research,  The  Ohio  State  University 


Task  decomjK>sitiou  ckxjs  not  spet'ify  conlroi  in  (.kiail 


Subprobiem  constraint  generation  may  involve  alternating 
between  design  proposal  &  constraint  generation 

-  propose  &  revise  method  (McDermott  and  Marcus) 

-  progressive  deepening  (Steier's  work  on  algorithm  synthesis) 

In  configuration  tasks,  subproblem  solution  already  given, 
problem  dominated  by  specification  generatbn 
&  solution  composition 


Labonlory  for  At  Research,  The  Ohto  Slate  University 


Design  Plans 

-  Sequence  of  design  actions 

-  Precompiled  partial  solution 

-  Recursive  application  of  plans 

~  Incorporates  decomposition  knowledge 
~  Indexed  by  functions  or  components,  multiple  plans  possible 

-  Inference:  instantiate  &  expand  (e.g.  Noah) 

-  Dependencies  bet.  parts  of  plan  may  be  discovered  or  pre-compiled 

Laboraiary  for  Al  Ratsarch,  Tha  Ohio  Slats  Univaraity 
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Ubiquir.'  of  Idea  of  Plans  in  Al 


Millar,  Galanter  &  Pribram:  Almost  all  knowledge  Is  plans 
C.  Rich  :  Programmer's  Apprentice 

Soloway  and  Johnson;  Programming  expertise  is  accumulation  of  plans 
Perry  Miller,  et  al..  Plans  for  Therapy  Planning  and  Critiquing 
Schank  &  Abelson,  Plans  for  Almost  Everything 
Friedland,  StefSc :  Molgen  ••  Planning  of  Genetics  Experiments 
Brown  &  Chandrasekaran:  Design  expertise  as  design  plans 
Mittal:  Design  expertise  as  design  plans 


Laboratory  tor  At  ReseareT),  The  Otiio  State  Univarsity 


l!)e.siart  Cases  For  Dersign  Prf.>{,H>sal 

METHOD  3  FOR  DESIGN  PROPOSING: 

-  Cases  (Sussman,  Schank) 

"All  Design  is  Redesign' 

'Critique  &  Modify  Almost  Correct  Design^" 

Important  Issues: 

~  Matching:  goal  priority,  similarity 

~  Critiquir^ 

- Simulation 

- Constraint  Analysis  (Dependency  Analysis) 

-  Modifying 

~  Another  design  problem 


Laboratory  tor  AIRataarcri,  The  Ohio  SiaiB  Univoraity 
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Ilie  PVCM  Meth<>d  Family-  Verily  Methods 


Design 

Task 


Propose  Design 
Critique  &  modify 
Family  of  Methods 


Laboratory  tor  Al  RM«arch,  Tba  Ohto  Stata  Univartity 


Methoiis  for  Verification 

Methods  for  Verification 

*  Direct  Calculation 

(Finite  Element,  e.g.) 

*  Simulation 

-  Quantitative 

-  Qualitative  Simulation 

~  Visual  (Perceptual)  Simulations 

Laboratory  tor  Al  Raaaarch,  Tha  Ohio  Stala  UnIvartiV 

A-20 


PVCM  Method  Family-  -  (Mticjite  Methais 


Design 

Task 


Propose  Design 
Critique  &  modify 
Family  of  Methods 


Laboratory  (or  Al  Research,  The  Ohio  State  Univertity 


Cxiiiquing  Methods 


Methods  for  Critiquing; 

*  A  generalized  version  of  the  diagnostic  problem 

*  Dependency  Analysis  (Sussman) 

*  Functional  Analysis  of  Proposed  Design  (Goel) 

-  If  design  proposal  is  endowed  with  causal  indices  that 
explicitly  indicate  relation  bet  structure  &  function,  substructures 
for  modificat'ion  can  be  identified 

. ? 

Laboratory  tor  Al  Resaarch.  Tha  Ohio  Staia  Univarsity 


'rhe  PVCM  Metl>od  Family-  -  ModitlcatioR  Merhods 


Design 

Task 


Propose  Design 
Critique  &  modify 
Family  of  Methods 


Laboratory  (or  Al  Research,  The  Ohio  State  Univaraity 


Modificarion  Methods 


Modification 

*  Means-ends  like  methods 

*  hill-climbing  for  parametric  (problems) 

*  If  criticsm  identified  missing  functions 

for  which  separate  components  should  be  added, 
modification  can  be  posed  as  a  new  design  problem 


Laboratory  (or  Al  neieerch.  The  Ohio  State  Univoraity 
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Hie  Task  Structure  As  An  Organizing  Idea 


Task  Analysis  in  the  form  of  task  structure  provides  dear 
The  Task  Structure  roadmap  for  knowledge  acquisition 

Task 


Subtasks 

’Primitive*  Subtasks 

LabooiKMy  tor  Al  Fteteareh.  Th*  Oito  State  Univarsity 
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CriierSa  f{>r  Choice  of  Methods 

-  Properties  of  Solution  (Oualitative/Quantitative),  degree  of  accuracy..) 
•  Propeties  of  Solution  Process  (Computational  properties, ...) 

-  Availability  of  Knowledge  in  the  Domain, 
to  Support  Method,  or  Knowledge 

to  Generate  Method-Specific  Knowledge 

•  No  one  ideal  method  for  design 

•  Not  NORMATIVE,  rather  language 

to  help  us  describe  domains  (Ctancey:  Knowledge  Bases  as  Domain 
Models) 


Laboratory  ter  Al  Research,  The  Ohto  Stale  Univeratiy 


Addidonal  Proj^erties  of  the  Task  Structure  View 


-  Explains  how  real  human 
expertise  comes  about  in  a 

tractable  way :  Not  by  access  to  a  UNITARY  algorithm 
for  a  complex  ta^,  by  accumulation  of  Knowledge  to 
decompose  tasks,  and  a  repertoire  of  methods  which 
match  tasks  and  the  domain  environment 

-  Integration  of  different  types  of  methods: 

Qualitative  and  quantitative 


Laboratory  tor  Al  Rasaarch,  Tha  Ohto  Statt  Univartlty 
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KBS  ARCIIITECTDRES 


1 .  TYPES  OF  INFORMATION  PROCESSING  PROBLEMS 
AND  HOW  APPROPRIATE  Al  IS  FOR  THEIR  SOLUTIONS 

2.  TYPES  OF  Al  ARCHITECTURES  THAT  ARE  UNDER  DISCUSSION  IN  Al 

3.  TASKS,  METHODS  AND  KNOWLEDGE 

4.  ARCHITECTURES  FOR  KNOWLEDGE-BASED  SYSTEMS 


Laboratory  tar  Al  ResMTCh.  Tha  Ohio  Stata  Univeraity 


Implications  tor  An  Afcbiteciurc  for  I^'oblent  Solving 


1.  Method  specific  shells,  pre-combined  (GTs,  TSA's) 

2.  Method-specific  shells,  to  be  dynamically  chosen 
&  combined  (TIP  work  of  Punch,  BB1, ..) 

3.  Methods  may  be  too  much  like  a  *big-switch' 

~  Use  methods  as  spedfication  of  a  set  of  goals,  but 
instead  of  scheduling  a  method  as  a  untt,  use  a  finer-grained 
goal  schooling  archKecture 

-  Implement  the  search-like  methods  in  '^ar'-like  architecture 

-  Todd  Johnson's  work  on  implementing  GPs  in  SOAR 

Laboratory  tar  Al  Raaaareh,  tha  Ohio  Stala  Unlvareiy 
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ARCHITECTURAL  IMPLICATIONS 
1.  Method-specific  shells,  pre-combined  (GT's,  TSA's) 


Laboratary  for  Al  Research,  The  Ohio  Stale  University 


MORE 

MORE  is  a  tool  for  acquiring  and  using  diagnostic  knowledge. 

Authors:  Gary  Kahn,  Steve  Nowlan  and  John  McDennott 
Camegie-Mellon  University 
Forerunner:  MUD  expert  system 

Task  Specification: 

Given  observations  about  symptoms,  determine  the  hypotheses. 
Kind  of  Knowledge: 

Hypotheses,  Symptoms  caused  by  hypotheses.  Tests  for  symptoms, 
and  Conditions  affecting  die  strength  of  links  between  entities. 
Measure  of  belief  and  disbelief  indicate  link  strength  (-10  to  -t-IO) 
which  can  vary  depending  on  conditions. 

Control:  Evaluate  hypotheses  by  treating  weights  like  EMYCIN 

certainty  factors. 

Special  Features: 

MORE  actively  elicits  knowledge 


labMiory  lor  Al  RoMorch,  Tho  Ohio  Staio  UnlvorSty 


CSRL 

CSRL  (Conceptual  Structures  Representation  Language)  is  a  tool  for 
hierarchies  classification,  which  is  searching  for  hypotheses  in  a 
classification  hierarchy. 

Authors:  Tom  Bylander  and  Sanjay  Mtttal 

The  Ohio  State  University  and  Xerox  Parc 

Forerunner:  MOX  expert  system 

Task  Specification: 

Find  the  catemries  or  hypotheses  that  apply  to  the  situation  being 
analyzed. 

Kind  of  Knowledge:  Hypotheses  and  associations  between  patterns  of 
data  and  confidence  in  hypothesis. 

Organization  of  Knowledge: 

-  Hypotheses  are  organized  as  a  classification  tree. 

Control: 

-  Top-down  multiple  path  search  of  the  classification 
hierarchy  (establish-refine). 

-  When  a  hypotheses  is  invoked,  its  knowledge  groups  are 
used  to  determine  its  confideiKe  value. 

Laboratory  for  Al  Raaaarch,  The  Ghio  Staia  Univarai^ 


HYPER 

HYPER  (Hypothesis  Matcher)  is  a  tool  for  hypothesis  matching,  which 
determines  the  fit  of  a  hypothesis  to  a  situation. 

Authors:  Todd  R.  Johnson,  Jack  W..Smith  &  Tom  Bylander 

The  Ohio  State  University 

Forerunner:  CSRL  tool 

Task  specification: 

Given  a  set  of  data,  determinmg  the  confidence  value  of  some 
hypothesis. 

Kind  of  Knowledge: 

Decisbn  tables,  called  knowledge  groups,  that  associate 
patterns  of  input  with  confidence  values. 

Knowledge  Organization: 

-  A  hierarchical  organization  of  knowledge  groups. 

-  Each  knowledge  group  specifies  how  selected  data  and  the 
results  of  its  children  are  mapped  to  a  confidence  value. 

-  The  root  knowledge  group  makes  the  overall  decision. 


LaboraioryforAIRMMrch,  Tb*  Ohfo  Stut  Uni¥«riity 


A-27 


IDABLE 


IDABLE  (Intelligent  Data  Base  Langu^e)  is  a  tool  for  knowledge-directed 
information  passing,  which  is  inferring  attributes  of  a  data  concept  from 
conceptually  related  data. 

Forerunner:  PATREC  subsystem  of  MOX  expect  system. 

Author:  Jon  Sticklen,  The  Ohio  State  University 

Task  Specification: 

Given  a  datum  to  retrieve,  infer  it  from  a  conceptually  related  datum. 
Kind  of  Knowledge: 

-  Data  concepts,  their  attributes  and  valid  values,  and  relations 
between  data  concepts. 

•  Inferences  for  mapping  one  datum  to  another  (along  some 
relationship). 

-  Temporal  relationships  between  data 
Knowledge  Organlaatlon: 

-  Data  concepts  are  organized  in  a  type  hierarchy. 

-  Inferences  are  attached  to  the  attribute  to  be  inferred. 

-  Case  data  is  temporally  organized  under  events  and  temporal 
relationships  to  events. 

Laboniory  tor  Al  RMMrch,  'Hw  Ohio  Saw  Univaniiy 


PEIRCE 

PEIRCE  is  a  tool  for  abductive  assembly,  which  is  finding  the  best  explanation 

for  a  set  of  data. 

Authors:  William  F.  Punch  Michael  C.  Tanner  &  John  Josephson 

The  Ohio  State  University 

Forerunner:  RED  expert  system 

Task  Specification: 

Given  a  set  of  data  to  explain  and  a  set  of  hypotheses  to  explain 
the  data,  find  the  best  composite  hypothesis  that  explains  the 
data. 

Kind  of  Knowledge: 

Plausability  of  hypotheses  and  the  data  that  each  hypothesis 
explains. 

Knowledge  Organization: 

Assemblers  indicate  what  subtask  to  do  next. 

Laboratory  tor  Al  RMMrch,  Tha  Ohto  Staia  Univartliy 
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HERACLES 


HERACLES  (Heuristic  Classification  Shell)  is  a  tool  for  performing  diagnosis 
by  heuristic  classification. 

Author:  William  J.  Clancey.  Stanford  University 

Forerunners:  MYCIN  expert  syste,NEOMYCIN  expert  system 

Task  Specification: 

Given  a  set  of  data,  determine  a  classification;  HERACLES  is 
conceptualized  as  a  diagnosis  tool. 

Kind  of  Knowledge: 

Findings,  Hypotheses,  and  Relations  between  them  (generalizations, 
taxonomic,  causal,  associational). 

Knowledge  Organization: 

-  Hypotheses  are  organized  as  a  classification  hierarchy. 

•  EMYCIN  production  rules  for  determining  confidence  in  hypostheses. 

-  Causal  structures.  Generalization  structures  for  findings,  etc. 


Laboratory  tor  Al  R«March,  the  Ohio  SBto  Univeraty 


HERACLES,  coviD 


Control: 


A  structured  collection  of  task  procedures  (each  task  is  implemented  by 
metarules)  including; 


-  Generate-questions  (probe  for  additional  information  to 
suggest  new  hypotheses). 


•  Group-and-differentiate  (focus  on  a  general  process  consistent 
with  hypotheses). 


-  Explore-and-refine  (top-down  search  from  some  hypothesis  in 
the  classification  hierarchy). 


-  Process-finding  (trigger  hypotheses  that  explain  a  finding). 


Special  Features: 


HERACLES  is  intended  to  be  a  complete  diagnosis  tool. 

Laboratory  tor  Al  Rotoarch,  Tbo  Ohio  State  Univertity 
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SALT 


SALT  is  a  tool  for  constructing  expert  systems  that  perform 
constraint-satisfaction  tasKS. 

Author:  Sandra  Marcus  and  John  McDermott 

Camegie-Mellon  University 

Task  Specification:  *Propo.se-and-Refine* 

Given  some  parameters  with  specified  values  specified,  assign  values 
to  all  parameters  satisfying  constraints. 

King  of  Knowiadge: 

•  Procedures  for  calculating  values  of  parameters 

-  Constraints  on  parameters 

-  Fixes  for  failed  constraints 
Knowledge  Organization: 

-  Procedures  form  an  aeydic  graph 

-  A  fix  revises  a  parameter  used  to  calculate  the  incorrect 
parameter 

Laboratory  for  Al  Research,  the  Ohio  Slate  Unkreraity 


Control: 


SALT,C0NT» 


•  Parameter  values  are  calculated  as  soon  as  possible,  i.e.,  when  a 
procedure's  inputs  are  available  and  its  precoiiditions  satisfied. 


•  Constraints  may  be  checked  either  as  soon  as  a  parameter  is 
calculated,  or  after  all  parameters  are  calculated. 


-  if  a  constraint  fails,  alternative  fixes  are  applied  to  previously  calculated 
parameters  until  the  constraint  is  satisfied. 


All  other  values  depending  on  the  fixed  parameter  are  recalculated. 


Special  Features: 


SALT  provides  a  knowledge  acquisition  facility  for  intennewing  experts 
and  helping  them  analyze  the  knowledge  base. 


Laboratory  for  Al  Retoarch,  Tha  Ohio  Siaia  Univoraity 


Generic  Tasks 


*  BUILDING  BLOCKS  FOR  COMPLEX  KNOWLEDGE-BASED 
PROBLEM  SOLVING  TASKS 

•EXAMPLES: 

*  HIERARCHICAL  CLASSIRCATION 

*  SYMBOUC  HYPOTHESIS  ASSESSMENT 

*  ROUTINE  DESIGN  BY  HIERARCHICAL  PLAN  SELECTION 

*  FUNCTIONAL  SIMULATION 

*  ABDUCTIVE  ASSEMBLY 

*  DATA  ABSTRACTION 

*  EACH  IS  A  "GENERIC"  TASK,  WITH  CHARACTERISTIC 
KNOWLEDGE  AND  STRATEGIES 


Laboratory  for  Al  RasMTCh,  The  Ohio  Staia  Univeralty 


Generic  Tasks^  comd 


COMPLEX  TASKS  SUCH  AS  DIAGNOSIS  CAN  BE  PERFORMED 
BY  DECOMPOSING  IT  INTO  GENERIC  TASKS  AND  THEIR 
RESULTS  COMPOSED 


DIAGNOSIS  AS  ABDUCTIVE  TASK 

•—  ABDUCTION  BY  DECOMPOSITION  INTO 

HYPOTHESIS  GENERATKM  MD  ABDUCTIVE  ASSEMBLY 


--  HYPOTHESIS  GENERATION  BY  HIERARCHICAL 

CLASSIFICATION  OVER  A  MALFUNCTION  HIERARCHY 

-  HIERARCHICAL  CLASSIRCATION  USING  HYPOTHESIS 

ASSESSMENT  AS  A  SUBTASK 


Laboratory  for  Al  Roaoarch,  Tho  Ohto  Siaii  Univoniiy 


A- 31 


OSU  LAIR 


GT  TOOLSET  CHARACTERISTICS 


•  Highly  modular  knowledge  structures,  functionally  organized.  Based  on 
cooperating  communities  of  specialized  agents  with  embedded  knowledge. 


•  Tools  are  work  together  for  building  problem  solvers  that  integrate 
different  stypes  of  reasoning. 


•  Explanation  facilities  are  coupled  to  problem-solving  architecture.  The 
course  of  the  problem  solving  makes  sense. 


•  Problem  solvers  can  call  upon  external  software,  and  can  be  embedded. 


Laboratory  for  Al  Research,  The  Ohio  State  Uittvarsity 


ARCHITECTURAL  IMPLICATIONS 
2.  Method-specific  shells,  to  be  dynamically  chosen 
&  combined  (TIP  work  of  ^nch,  BBl, 


Laboratory  for  Al  Research,  The  Ohio  Stale  UnKrarsity 
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TIPS  AND  BB1 


*  CENTRAL  IDEA  IS  A  MECHANISM  THAT  CAN 
INVOKE  DIFFERENT  METHODS  THAT  ARE  RELEVANT 
FOR  THE  CURRENT  GOAL,  EVALUATE  THEM  FOR 
APPROPRIATENESS  FOR  THE  CURRENT  SITUATION. 

AND  CHOOSE  THE  "BESr  METHOD 

*  THIS  IS  DONE  RECURSIVELY.  METHODS  CAN  BE  ABANDONED 
AND  OTHER  METHODS  CHOSEN 


Labomnyy  for  At  Research,  The  Ohio  SUM  Univaraity 


ARCHITECTURAL  IMPLICATIONS 
3.  use  a  finer-grained  goal  scheduling  architecture 


Laboratory  for  At  Research,  The  Ohio  Stale  University 
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Soar 


Soar  (Newell,  et  al.) 

*  The  Knowledge  Base  is  used  to  set  up 

a  Problem  soace  in  response  to  a  goal 

*  Operators  have  to  be  selected  &  applied  to 

search  the  problem  space. 

*  Impasses  can  occur  at  any  stage 

(such  as  operator  missing,  pre  conditions  not  satisfied, ...) 

*  Subgoals  to  resolve  such  impasses  can  be 

set  up  recursively. 

*  uniform  treatment  as  search  in  problem-space 

*  goals  can  be  mixed  from  different  methods 

to  ::chieve  fine-grained  control  structure 

*  Generic  Tasks  in  Soar  (Todd  Johnson,  91) 


Laboratory  ter  AIRMMrch,  ThoOhte  State  Unhraraity 


Relation  of  Tfisk-s/Metbods  Viewptjint 
to  Traditional  GT  Work 

1 .  GTs  (so  far)  are  fixed  task-method-subtask 
combinations  of  very  generic  utility. 

2.  New  approach  retains  basic  insight  of  GTS; 
close  connection  bet  method,  knowledge  & 
inference,  but  generalizes  ideas 

-  Bridge  between  flexible  GP  architectures  & 
methods  that  take  advantage  of  task  structure 


Laboratory  ter  Al  rteioarch,  TIw  Ohio  Slate  Univoriity 
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Librari^rs  of  Tasks.  Methods  and  Knowiedg?; 

What  is  really  going  on  is  that  there  n  an  intematbnal  collection  of 
researchers  devek^ing  precisely  this:  a  libr^  of  tasks  artd  niethods 
for  many  important  complex  tasks,  such  as  di^nosis  and  design. 

Reusable  knowledge  modules,  indexed  by  domain,  method  and  knowledge 
will  soon  bmme  available 

Task:  _ 

Diagnosis 

(de  Kleer,  Reiter) 

•INTRACTABLE* 

Domain  Knowledge  Modeling. 

FORMAL  COMPLEXITY  ANALYSIS  OF  METHODS 

Laboratory  ter  AIRetearm,  Itw  Ohio  State  UnKreralty 
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appendix  b 


TIMING  ISSUES  IN 


REAL-TIME  INFORMATION  SYSTEMS 


Jane  W,  S.  Liu, 

Department  of  Computer  Science 
University  of  Illinois 
Urbana,  Illinois  61801 


Typical  hard  real-time  systems  and  their 
timing  constraints 

Challenges  in  maintaining  timing  constraints 

Traditional  approaches  to  hard  real-time 
scheduling  and  resource  management 

Recent  progresses  and  unsolved  problems 


TERMINOLOGY 


•  Job  or  task: 

—  A  unit  of  work  (a  granule  of  computation,  or 
data  transmission,  etc.) 

•  Hard  real-time  task: 

—  A  task  whose  failure  to  complete  in  time  is 
considered  to  be  a  fatal  error. 

•  Soft  real-time,  essential  task: 

—  A  task  whose  result  becomes  less  and  less 
useful  after  its  deadline 

—  Examples:  display  update,  operator  commands 
and  non-critical  monitoring. 

•  Soft  real-time,  non-essential  task 

—  A  task  that  may  be  aborted  after  its  deadline. 

—  Examples:  connection  establishment,  input/output 
requests. 
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Control  Hierarchy 


Optimal 

control 

(mln-hr) 


Typical  computational  problems 


Direct 

computer 

control 

(msec) 


Time-optimal  problems 
Fuel-optimal  problems 
Fuel-optimal  rendezvous  problems 
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An  Example 


r - 1 
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Rigid  timing  constraints  and  exact 
computations  —  hard  real-time 

Flexible  timing  constraints  and  exact 
computations  —  soft  real-time 

Rigid  timing  constraints  but  flexible 
computations  —  imprecise  computations 


Typical  radar  processing  system 


memory 


sampled 


digitized  data 


256  - 1024 
samples  /  bin 


control  status 


. . 


track 


records 


track 

records 


signal 

processor 


1" .  1 

1 

Pipeline 

Processors 

- 

(with  local  memory) 

. i' . 


control 

status 

program 


P 


1 

1 

1 

1 

1 

1 

1 

1 

signal 

1 

1 

signal 

1 

processing 

1 

processing 

[  Ti 

The  Traditional  Periodic  Job  Model 

(The  simplest  scheduling  problem) 

•  The  work  to  be  scheduled  on  each  processor  is  a  set 
of  jobs: 

—  Each  job  is  a  periodic  sequence  of  requests 
for  the  same  computation,  called  tasks. 

—  The  period  of  a  job  is  the  length  of  the  interval 
between  arrivals  of  two  consecutive  tasks. 

—  The  execution  time  of  the  tasks  in  each  job 
is  given. 

—  The  tasks  are  to  be  scheduled  preemptively. 

—  Each  task  must  be  completed  before  the  next 
task  becomes  ready. 

•  The  problem: 

Find  a  feasible  schedule  in  which  every  task 
completes  before  its  deadline. 
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A  cyclic  executive  is  a  program  that 
deterministically  interleaves  the  execution 
of  periodic  tasks  on  a  single  processor. 

Example 

A  =  (10,1  ) 

B  =  (10,3) 

C  =  (15.2) 

D  =  (30,8)  =>  (30,3)  (30,5) 


Priority-Driven  Algorithms 


They  are  scheduling  algorithms  that  never  leave  the 
processor(s)  idle  intentionally. 

Such  an  algorithm  can  be  implemented  as  follows: 

—  Assign  priorities  to  tasks. 

—  If  preemptive,  scheduling  decisions  are  made 

o  when  any  task  becomes  ready, 
o  when  any  task  completes, 
o  when  the  task  priorities  change 

—  At  each  decision  time,  schedule  and  execute 
the  ready  task  with  the  highest  priority. 

A  priority-driven  algorithm  is 

—  static  if  priorities  are  assigned  to  jobs  once 
and  for  all,  and  is 

—  dynamic,  if  priorities  of  tasks  may  change. 


2 


14 


Non-preemptive  EDF,  FBP'O 


missed 

deadline 


Preemptive  EDF 


I  ^2  I  Tj 

3  4 


Non— preemptive,  not  priority— driven 


3  A  4 


intentional 
idle  time 
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Two  Classical  Algorithms 


•  Rate  monotone 

—  higher  priorities  are  assigned  to  jobs  with 
shorter  periods 

—  optimal  among  all  static  priority-driven 
algorithms 

•  Earliest  deadline  first 

—  the  highest  priority  is  assigned  to  a  task 
with  the  earliest  deadline. 

—  optimal  among  all  dynamic  priority-driven 
algorithm 


B-ll 


An  example:  schedule  (2, 0.9)  (5,  2.3)  on  one  processor 


•  Rate  monotone 


•  Earliest  deadline  first 


i - 

1 - 

■i 

\ — — 

< - 

•  Schedule  (2, 1)  and  (5,  2.5)  by  Rate  monotone 


t 


missed  deadline 


•  According  to  Earliest  deadline  first 


•  Schedulability  Criteria 

—  The  total  utilization  t/  of  a  set  of  job  = 

the  fraction  of  time  it  keeps  a  processor  busy. 

—  The  set  is  schedulable  using 

.  the  earliest  deadline  first  if  C/  <1, 

•  the  rate  monotone  if  (/  <  0.82  —  0.63. 
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Undesired  behavior  of  the  earliest  deadline  first  algorithm 
•  Schedule  (2, 1)  and  (5, 3)  with  U  =  1.1 


•  Schedule  (2, 0.8)  and  (5, 3.5)  with  £/  =  !.! 


■ 

■ 

1 

1— 

■IH 

As  U  increases  which  deadlines  will 
be  missed  cannot  be  predicted. 
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0 


2 


4 


6 


8  10  12  14  16  18  20  22  24  26 

Ti  =  (  8,  5  ),  Tj  =  (  22,  7  ),  Ta  =  (  26,  4.5  ) 


We  have  methods  to  predict  such  behavior 
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State  of  the  Art  in 

Scheduling  and  Resource  Management 

To  support  time-critical  applications,  scheduling 
algorithms  should 

—  guarantee  enforcement  of  timing  constraints 

—  have  predictable  behavior  under  dynamic  loads 

—  make  effective  use  of  system  resources 

We  have  optimal  and  good  heuristic  algorithms  for 
traditional  applications:  tasks  to  be  scheduled  are 

—  periodic  and  have  bounded  processing  times,  or 

—  aperiodic  with  bounded  interarrival  times. 

We  need  good  heuristics  for  scheduling 

—  tasks  that  have  time-varying  and  data-dependent 
processing  times  and  resource  requirements, 

—  tasks  with  end-to-end  timing  constraints  on 
multiprocessor  and  distributed  systems,  and 

—  tasks  that  interact  frequently. 


We  need  integrated  strategies 


Two  Challenging  Problems  in 


Inteliigent,  Time-Critical  Information  Systems 


•  How  to  keep  information  temporally 
consistent 

•  How  to  deal  with  unpredictability  in  time 
and  resource  requirements 
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Temporal  Consistency 


A  set  of  data  objects  gives  the  state  of  the 
real  world. 

•  Relative  temporal  consistency: 

The  data  objects  represent  a  snapshot. 

•  Absolute  temporal  consistency: 

The  snapshot  is  sufficiently  current. 
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Data  Objects 


Images  which  store  sampled  values 
of  real-world  objects 

In  the  ith  period  after  image  a:  was  last 
updated,  age  of  a:  =  /  +  1. 

Derived  objects  whose  values  are 
computed 

If  the  derived  object  y  is  computed  from 
{ xi,  yj},  age  of  y  =  max  { ages  of  xi  and  yj} 
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Relative  Temporal  Consistency 


■ 

» 

_ 1 _ 1 _ 1 _ 

■ 

» 

_ 1 _ 1 _ 1 _ 

■ 

_ 1 _ 1 _ 1 _ 

0  5  10  15 

l«W 

B(») 

preempted 

_ 1 _ 1__ 

_ ^ _ 

_ 1 _ 

0  4  11 

•  The  values  of  x  and  y  read  by  T2  may  be  temporally  dispersed. 


Absolute  Temporal  Consistency 


W{ 

» 

_ 1 _ 1 _ 1 _ 

mtes 

_ 1 _ 1 _ 1 _ 1 _ 

» 

_ 1 _ 1 _ 1 _ 

0  5  10  15 

«{*) 

^2  1 _ 1 _ 1  1 

_ 1 _ 1 _ 1 _ 1 _ 

_ 1 _ 

0  4  11 


•  The  values  of  x  read  by  may  be  obsolete. 
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The  Imprecise  Computation  Model 


I  I  A  task,  a  granule  of  computation 
— Dependency  between  tasks 


Monotone  Imprecise  Computations 


Error 


time 


mandatory 
P  subtask 


optional 

subtask 


error  recovery  mandatory 


error  recovery  optional 


Precise  Results  Vs  Imprecise  Results 
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The  called  procedure 


newton(xguess,  xdistance,  acceptable_distaiice,  desired_distance) 

{ 

/*  mandatory  part  */ 

repeat  { 

xnew  =  xguess  -  f(xguess)/fprime(xguess); 
xdistance  =  xnew  -  xguess; 
xguess  =  xnew; 

}  while  (abs(xdistance)  >  acceptablejdistance); 
returnjmprecisejreault  (xnew,  xdistance); 

/*  optional  part  */ 

repeat  { 

xnew  =  xguess  -  f(xguess)/fprime(xguess); 
xdistance  =  xnew  -  xguess; 
return^mprecisejresult  (xnew,  xdistance); 
xguess  =  xnew; 

}  while  (abs(xdistance)  >  desiredjdistance); 
return^reeise^esult  (xnew,  xdistance); 


The  calling  procedure 


guess  =  iiiitial_value,* 
imprecisejresult  (newton,  handle); 

newton(g\iesSf  distance,  acceptable_range,  desired_range); 
if  (distance  <  desired^range) 

/*  a  precise  result  returned  */ 
else  *y  (distance  <  acceptable^range) 

/*  an  acceptable  imprecise  result  returned  */ 
else  /*  the  returned  result  not  usable  */ 


} 
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Suitable  Methods  and  Applications 


•  Iterative  algorithms,  accumulation  problems 

•  Statistical  methods,  e.g.  Monte  Carlo  method 

•  Approximate  relational  algebra  query  computation 

•  Successive  doubling  method  for  computing  FFTs 

•  Generation  of  images  from  holograms 

•  Transmission  of  compressed  digitized  voice 

•  Transmission  of  facsimile  images 
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Application  in  the  Database  Domain 


Work  in  progress 

—  Monotone  query  processing  when  the 
database  contains  complete  information 
and  queries  are  precise. 

Future  work 

—  Monotone  query  processing  in  the 
presence  of  imprecise  or  incomplete 
information 


Imprecise  updates 


1 


Where  can  flight  United  941  land? 


The  Exact  Relation 


location 

Appro.  Dist, 

Chi-O’Hare,  IL 

350 

Manchester,  NH 

250 

Peoria,  IL 

400 

Syracuse,  NY 

100 

An  Approximate  Relation 


Location 

Appro.  Dist. 

Chi-O’Hare,  IL 

350 

Peoria,  IL 

400 

Paris,  IL 

380 

Hell,  MI 

330 

Manchester,  NH 

250 

Berlin,  MA 

300 

Syracuse,  NY 

100 

Certain  tuples 


Possible  tuples 


B-27 


Location 

Chi-0*Hare,  EL 

350 

Peoria,  IL 

400 

Syracuse,  NY 

100 

Paris,  EL 

380 

Hell,  MI 

330 

Manchester,  NH 

250 

Berlin,  MA 

300 

Location 

Chi-O’Hare,  EL 

300 

Peoria,  IL 

400 

Paris,  EL 
Manchester,  NH 

Berlin,  MA 

300 

Syracuse,  NY 

100 

Location 

Appro.  Dist. 

Chi-O’Hare,  IL 

300 

Peoria,  IL 

450 

Paris,  IL 

380 

Hell,  MI 

330 

Manchester,  NH 

250 

Berlin,  MA 

300 

Syracuse,  NY 

100 
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Elements  of  the 

Approximate  Relational  Model 


A  set  of  all  approximate  relations  of  any 
standard  relation  —  approximate  answers 
to  a  relational  query 

A  partial  order  relation  over  the  set  for 
comparing  them 

Monotone  approximate  relational  algebra 
operations 

A  monotone  query  processing  algorithm  for 
returning  monotonically  improving  answers 


APPROXIMATE 

{A  Prototype  Monotone  Query  Processor) 


•  Supports  processing  of  relational  algebra  queries  on 
relational  database  systems 

•  Uses  an  objected-oriented  approach  to  implementation: 

—  relies  on  an  object-oriented  view  for  semantic 
support  in  identification  of  initial  approximations 

—  avoids  operations  on  possible  tuples 

—  provides  lazy  evaluation  of  possible  classes  upon 
user  interrupt  or  faults 

•  Improves  data  availability  and  query  processing  time 
predictability 
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Elements  of  An  Architectural  Framework 


•  Process  Structure 

Client  Server 


Scheduler 


Result  Saving  Protocol  —  for  the  provision  of  monotonically 
improving  imprecise  results 

Imprecise  Service  Establishment  Protocol  —  governing  the 
interactions  among  the  application  tasks,  as  well  as  among 
the  application  tasks  and  the  underlying  support  system 

Imprecision  Management  Policies  and  Mechanisms  —  to 
ensure  the  correct  usage  of  imprecise  results 


Imprecise  Service  Establishment 


Client  Server  Scheduler 
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Periodic  Job  Models 


•  Precise  periodic  job  set  {Jk} 

—  Jk  is  specified  by  (pk^'^k) 

Pk  =  period  length. 

Zk  =  execution  time 
ready  times  and  deadlines 


•  Imprecise  periodic  job  sets  {Mk}  and  {Ok} 
—  =  minimum  execution  time 

—  Given  Jk  -(pk.'^k)  there  is 

o  a  mandatory  job  Mk  =(pk^f^k) 
o  an  optional  job  Ok -ipky  '^k  ) 


mandatory 


optional 


* - mk 


Zk  ~’mk  — ^ 
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{Jk}:  (2,1)  (4,0.5)  (5,0.5)  (6,1.5) 


0  2  4  6  8  10  12  14  16 


missed  deadline 


A  traditional  rate-monotone  schedule 
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Types  of  Jobs 


•  Error  non-cumulative  jobs 

—  optional  parts  need  not  be  completed 

—  the  average  effect  of  error  is  observable 

—  example:  image  enhancement 

•  Error  cumulative  jobs 

—  optional  part  must  be  completed  occasionally 

—  errors  in  different  periods  have  cumulative 
effects 

~  example:  target  tracking 
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Performance  Measure  for  Scheduling 
Error— noncumulative  Jobs 


Error  Functions 


Average  error  of  job  over  p  periods 
p  ==  least  common  multiple  of  all  periods 

Average  error  over  K  jobs  in  J 


Relative  Merits  of  Different  Algorithms 


U  <  1 

u>\ 

Classical  scheduling 

Our  scheduling 

algorithms: 

algorithms: 

Earliest-deadline 

concave 

error 

Most-attained-time 

Least-slack-time 

linear 

error 

Least-utilization 

Rate-monotone 

convex 

error 

Least-attained-time 
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Scheduling  Type  C  Jobs 
( The  Simplest  Case  ) 

•  Jobs  in  J  has  the  same  period  p 

•  One  task  in  every  job  must  complete  in  Q  periods 
—  Cumulation  rate  =  Q 

•  After  the  tasks  in  M  are  assigned,  we  have 


0  p  2p  Zp  (Q— l)p 

■  ■  ■  *  I  ■ 


'  *.1 


Must  pack  K  pieces  of  lengths  in 

Q  bins  of  size  p  — 


Schedulability  Criterion 
{for  the  length  monotone  algorithms  ) 


A  set  of  jobs  with  repetition  period  p,  cumulation  rate  Q, 
total  utilization  factor  C/,  and  minimal  utilization  factor  u  is 
schedulable  if 


0  0.5  1.0  1.5  u 
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Scheduling  Aperiodic  Jobs 
That  Allow  Imprecise  Results 


•  Given  a  dependent  tasks  set  {  } 

—  Each  task  is  characterized  by 

2  ready  time 
0  deadline 
0  execution  time 

—  dependencies  between  the  tasks 

•  Each  task  is  decomposed  into 

—  a  mandatory  subtask 

—  an  optional  subtask 
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An  Example:  Scheduling  to  Minimize  Error 


0.0 

0.6 

0.4 

0.2 

0.2 

0.2 

0.7 

0.4 

0.1 

0.3 

0.4 

1.0 

0.5 

0.2 

0.3 

1.2 

1.5 

0.3 

0.1 

0.2 

0.6 

2.0 

0.8 

0.5 

0.3 
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Available  Building  Blocks  of  Imprecise  Computation  Systems 


•  Algorithms  for  scheduling  aperiodic  tasks  with  ready  times, 
deadlines  and  precedence  constraints  on  uniprocessors 

—  to  minimize  the  total  error 

—  to  minimize  the  maximum  error 

—  to  minimize  the  mean  flow  time 

—  to  minimize  the  number  of  discarded  optional  subtasks 

•  Algorithms  for  scheduling  aperiodic  independent  tasks  with 
ready  times  and  deadlines 

—  to  minimize  the  total  error  on  multiprogrammed 
multiprocessor  systems 

—  to  minimize  the  total  error  when  parallelized 

•  Algorithms  for  scheduling  periodic  tasks,  that  are 

—  error-noncumulative,  to  minimize  average  error 

—  error-cumulative,  to  meet  all  deadlines 

•  Task  assignment  algorithms  for  replicated  imprecise  tasks 

•  2-level  queuing  policies  for  optimal  tradeoff  between  average 
response  time  and  result  quality 

•  A  monotone  query  processor  for  relational  algebra  queries 
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Using  Imprecise  Computation  Technique 


•  to  increase  availability 

•  to  reduce  the  need  for  error  recovery 

•  to  simplify  recovery  actions 

•  to  support  semantical  approaches  to  reduction 
of  checkpoint  sizes 

•  to  reduce  the  overhead  of  replication 
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Problems  in  Different  Application  Domains 


Solve  /(Jc,O  =  0 

for  XI  X2  Xi-i  xi 

1 _ I _ _ 1— _ L 

at  t\  t2  ti-i  ti 


•  Error -noncumiilative: 

—  Type-1  problems: 

0  Xi  is  independent  of  jc/-2»  *  *  * 
o  Examples  include  signal  processing,  still-image 
transmissions,  database  queries,  etc. 

•  Error-cumulative: 

—  Type-2  problems; 

o  Xi  =  Xi-u  but  is  independent  of  Xi-2,  ^/-3»  *  ‘  * 
o  Examples  include  Newton-like  iterative  algorithms, 
traditional  feedback  control,  tracking,  etc. 

—  Type-3  problems: 

o  Xi  is  dependent  on  x,  _2,  *  *  * 
o  Examples  include  real-time  simulation  and 
non-linear  control 
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Types  of  Imprecision  Management  Policies 
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Results  of  Requirement  Imprecise  Management 

Imprecise  Computations  Specifications  Rules 
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Current  and  Future  Work 


•  Development  and  evaluation  of  the  needed  architectural 
elements  for  the  efficient  generation  and  correct  usage 
of  imprecise  results  for  fault  tolerance 

•  Development  of  useful  approximation  semantics  and 
monotone  algorithms  in  key  application  domains 

•  Development  of  imprecision  management  policies  for 
application  domains  including 

—  presentative  numerical  computations 

—  compressed  voice  and  video  data  transmissions 

—  signal  processing 

—  feedback  control,  feedforward  control,  declarative 
control,  etc. 

—  database  queries  and  updates 
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SUMMARY 


•  The  desired  characteristics  of  time-critical  systems 
is  PREDICTABILITY,  not  speed,  or  fairness,  etc.: 

—  Under  a  nonnal  load,  all  hard  real-time  tasks 
meet  their  timing  constraints. 

—  Under  overload  conditions,  failures  in  meeting 
timing  constraints  occur  in  predictable  manner 


•  We  need  application-directed  scheduling  and 

resource  management: 

—  Scheduling  mechanisms  should  allow  different 
policies. 

—  Resolution  of  resource  conflicts  can  be  under 
the  explicit  direction  of  the  applications. 

•  Applications  must  be  designed  so  that  either 

—  they  have  deterministic  behavior,  or 

—  they  allow  trade  off  between  time  and  result 
quality. 
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gurrent  Research  in  Deductive 
atabases 

•  Disjunctive  Logic  programs  and 
Deductive  databases 

•  Combining  deductive  databases 

•  Cooperative  Answers 

•  Null  values  in  deductive  databases 

•  View  updates  for  deductive  databases 


Deductive  databases  ^ 
Background 

A  deductive  database  is  a  function  free 
Horn  logic  program. 

(Definite)  Logic  Programs 

A  < —  Si , . . . ,  Bfi 


Example 

stealth  radar,  faint 
plane  radar,  clear 
radar 
clear 

Meaning 

{radar,  clear,  plane,  stealth} 
{radar,  clear,  plane}* 


. . .  Background 

Use  of  negation 

A  Bi, . . . ,  Bn,  not  Di, . . . ,  not  Dm 


Stratified  Logic  Programs 

stealth  ^  radar,  faint 
radar _ 

plane  ^  radar,  not  faint 


Normal  Logic  Programs 

stealth  ^  radar,  faint,  not  obstructed 
obstructed  ^  radar,  faint,  not  stealth 
lookdown  ^  obstructed 
lookdown  <—  stealthradar 
faint 


Summary  of 
Definite  Semantics 


Semantics 

Theory 

Reference 

Positive  Consequences 

Fixpoint 

Tp 

vEK76 

Model 

Least 

Model 

vEK76 

Procedure 

SLD 

ffill74 

Negation 

Theory 

CWA 

Rei78 

Rule 

NAF 

Cla78 

Procedure 

SLDNF 

Cla78 

Stratified  Programs 


Fixpoint 

Tp 

ABW88 

Model 

Standard 

ABW88 

Perfect 

Prz88 

Normal  Programs 


Fixpoint 

Well-Founded 

JOO 

VRS88b 

Model 

Mwf(P) 

VRS88b 

Procedure 

SLS 

Ross89b /Przy89b 

Fixpoint 

General  Well-Founded 

BLM89b 

Model 

BLM89b 

Procedure 

SLIS 

BLM89a 

Recursion  and  Bottom-Up 
Computation 

Top-Down  produces  answers  one  by  one. 
Bottom-  Up  produces  all  answers  at  once. 

Computational  engine 

Hierarchical  programs:  Relational  algebra. 

Recursive  programs:  Compute  least  fixpoint 
of  relational  equations. 


Optimization:  Magic  Sets  restrict  the 
computation  to  what  is  needed  for  the 
query. 


Disjunctive  Deductive 
Databases 

A  disjunctive  deductive  database  consists 
of  clauses  of  the  form: 

^1  V  •  •  •  V  Ajt  ^  Si, . . . ,  Bn,  not  Di, . . . ,  not  Dm 

Example 

stealth(A’)  V  obstructed(X)  radar(X),  faint (X) 

plane(A’)  <<—  radar(X),  not  faint(X) 

radar(3) 

radar(l)  V  radar(2) 
faint(l)  V  radar(2) 

Meaning: 

{radar(3),  radar(2),  plane(3),  plane(2)} 

{radar(3),  radar(l),  faint(l),  plane(3),  stealth(l)} 
{radar(3),  radar(l),  faint(l),  plaiie(3),  obstructed(l)} 

Queries  and  Answers 


Query 

Answers 

radar(X) 

radar(3) 

radar(l)  V  radar(2) 

faint(X)  V  plane(y) 

plane(3) 

faint(l)  V  plane(2) 

Disjunctive  Semantics 


Semantics 

Horn 

Disjimctive 

Theory 

Reference 

Theory 

Reference 

Positive  Consequences 


Fixpoint 

Tp 

vEK76 

T‘p 

MR90 

Model 

Least 

Model 

vEK76 

Min.  Model 

Min82 

Model-State 

LMR89 

Procedure 

SLD 

Hill74 

SLO 

LMR89 

Negation 


Theory 

CWA 

Rei78 

GCWA 

Min82,MR90 

WGCWA 

Ross87,  RLM87 

Rule 

NAF 

Cla78 

SN 

MR88 

NAFFD 

RLM87 

Procedure 

SLDNF 

Cla78 

SLONF 

MRL89 

Stratified  Programs 


Fixpoint 

Tp 

ABW88 

MR89 

5? 

Ross87,MR89 

Model 

Standard 

ABW88 

Stable  State 

MR89 

Perfect 

Prz88 

Perfect 

Prz88 

Normal  Programs 


Fixpoint 

Well-Foimded 

Strong/ Weak  Well-F  /Stationary 

JOO 

VRS88b 

1 

Model 

Mwf(P) 

VRS88b 

mmm 

Ross89a 

Mp 

Przy90 

Procedure 

SLS 

Ross89b/Przy89b 

Fixpoint 

General  Well-Foimded 

General  Disjunctive  Well-Founded 

BLM89b 

BLM90 

Model 

WMMM 

BLM89b 

BLM90 

Procedure 

SLIS 

BLM89a 

SLIS 

Prz90 
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Disjunctive  Bottom-Up 
Computation 

Problem  How  to  represent  the  minimal 
models? 

Minimal  models 

{radax(3),  radax(2),  plane(3),  plane(2)} 

{radar(3),  radar(l),  faint(l),  plane(3),  stealth(l)} 
{radar(3),  radar(l),  faint(l),  plane(3),  obstructed(l)} 


Model-tree 

e 

I 

radar(3) 


plane(3) 


radar(l)  radar(2) 

I  I 

faint(l)  plane(2) 


8tealth(l) 


ob8tnicted(l) 


Disjunctive  Bottom-Up 
Computat  ion 

Problem  How  to  represent  the  minimal 
models? 

Minimal  models 

{radar(3),  radar(2),  plane(3),  plane(2)} 

{radar(3),  radar(l),  faint(l),  plane(3),  stealth(l)} 
{radar(3),  radar(l),  faint(l),  plane(3),  obstructed(l)} 


State  Model 

Model-tree 

e 

radax(3), 

1 

radar(3) 

plane(3), 

1 

radax(l)  V  radar(2), 

plane(3) 

radax(l)  V  plane(2), 

faint(l)  V  radar(2), 

radax(l)  radar(2) 

1  1 

faint  (1)  V  plaiie(2), 

1  1 

stealth(l)  V  obstnicted(l)  V  radar(l), 

faint(l)  plane(2) 

stealth(l)  V  obstnicted(l)  V  plaiie(2)  } 

stealth(  1 )  obstructed(  1 ) 
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IJgjture  ^ork  on 
Disjunctive  Deductive 

Databases 


•  Algorithms  for  Stratified  and  Normal 
DDDB. 


•  Magic  Sets,  or  other  optimization 
techniques. 

•  Algorithms  for  updating  relations  and 
views  in  a  DDDB. 
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Combining  Databases 

Problem:  Given  a  set  IC  of  integrity 
constraints  and  databases  DBi  and  DB2 
each  one  consistent  with  IC  and  such  that 
DBi  U  DB2  is  inconsistent  wrt  IC, 

How  can  consistency  be  restored? 


Example: 


DBi  = 

DB2  = 


{a;  c} 


{o;6;c; 


a}\ 

—  b  A  d} 


d  < — 


c«i2 


Results  on  Combining 

Create  a  disjunctive  database: 

DB\  "1“  DB2  “  ■{  c?  ^ —  CL 

c 

aWb  } 

IC  =  {  Ad} 

DBi  +  DB2  is  maximaly  consistent  with 
respect  to  IC, 

All  minimal  models  satisfy  the  IC: 

{c,  a,  d} 

{c,b} 


The  Algorithm  combines  definite 
stratified  deductive  databases. 
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Future  ^ork  on 
Combining  Databases 

•  Algorithms  for  First  Order  theories  and 
Disjunctive  databases. 


•  Theories  with  defaults 

-  Auto-epistemic, 

-  Stable  Clases, 

“  etc. 


•  Prioritizing  information. 


•  Aplication  to  view  updates. 
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Abstract 

Many  widely  used  programs  contain  valuable  knowledge  of  algorithms,  processes  and  methods  which 
could  be  employed  usefully  in  expert  systems.  The  basic  notion  is  to  extract  such  knowledge  from 
reliable  programs,  translate  it  into  expert  system  rules  and  enter  the  generated  rules  into  an  expert 
system  knowledge  base.  These  programs  can  provide  a  reliable  and  consistent  set  of  rules  that  search 
a  problem  space  systematically  and  assure  termination  of  execution.  The  expert  system  can  then  be 
employed  to  provide  a  user  with  expertise  on  the  respective  algorithms,  procesMS  and  methods.  A  visual 
programming  tool  can  be  provided  to  employ  the  expert  system  in  testing  ct  the  programs. 

For  exanq>le,  a  program  to  solve  simultaneous  equations  is  used  in  a  wide  variety  of  systems,  e.g.,  for 
allocation  of  transportation  resources.  It  can  be  translated  into  a  set  of  rules  that  embody  the  solution 
method.  The  rules  can  then  be  loaded  into  the  knowledge  base  of  an  expert  system.  A  user  of  the  expert 
system  may  input  a  problem  and  have  the  expert  system  determine  the  scheduling  and  allocation  of 
resources,  as  well  as  “explain”  the  decisions  by  referencing  the  rules  that  it  has  used. 

In  this  manner,  it  wiQ  be  possible  to  autcnnatically  tiq>  an  immense  source  of  knowledge  embedded 
in  existing  and  future  programs  to  enrich  the  knowledge  base  of  expert  systems. 

The  translation  of  a  program  into  expert  system  rules  consists  first  of  translating  the  procedural 
language  program  into  an  equational  (functkuial)  language.  The  advantages  of  an  equational  language 
are  in  the  arbitrary  ordering  of  equations  and  no  side-effects  [Prywes, 90].  The  equations  are  translated, 
one  by  one,  into  respective  rule  definitions  used  by  an  e]q>ert  system.  The  execution  model  of  an 
equational  language  can  be  graphically  portrayed  by  a  data  flow  or  Petri-net  diagram.  It  is  similar  to 
the  Rete  algorithm  [Forgy,82]  u^  to  schedule  execution  (pattern  matching)  of  expert  system  rules. 

The  paper  examines  the  feasibility  of  this  approach.  Ihe  ^>proach  is  demonstrated  in  translating  a 
MODEL  equational  language  program  [SsymanskifcPrywes,88]  for  solving  linear  simultaneous  equations 
into  rules  for  the  CLIPS  expert  system  [Giarratano,89].  The  paper  then  explores  use  of  the  expert  systm 
and  visual  programming  to  test  the  rules  extracted  from  a  program. 


*  Supported  by  Contract  AFOSR-90-0335A  from  AFOSR 
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INTRODUCTION 


'he  underlying  notion  in  this  paper  is  that  there  exist  several  computer  programming  paradigms  —  each 
dth  its  own  advantages,  and  that  the  translation  among  these  paradigms  can  benefit  each  other.  The  paper 
roposes  combining  these  translations  with  a  powerful  graphics  facility  for  visual  programming  [Kim, 91]. 

?he  paper  addresses  especially  translations  among  three  programming  paradigms  and  their  respective 
inguages,  as  follows: 

(1) .  Procedural  Languages:  This  paradigm  is  based  on  the  computation  model  of  the  sequential 

computer  [Gelemter,90].  It  is  by  far  the  most  developed  and  popular  paradigm  with  extensive 
existing  libraries  of  programs. 

(2) .  Equationsd  Languages:  This  paradigm  is  based  on  stating  a  program  as  a  set  of  equations 

[Szymanski,91].  The  underlying  computation  model  is  a  Petri-net  graph.  Computation  proceeds 
as  soon  as  inputs  become  available.  This  paradigm  is  advantageous  (a)  where  a  program  reliability 
is  important  and  (b)  for  parallelizing  the  computation. 

(3) .  Rule-Based  Languages:  In  this  paradigm,  a  program  can  be  developed  progressively  accumu¬ 

lating  rules  in  a  knowledge  base.  The  computation  model  is  based  on  the  Rete  pattern  matching 
algorithm  [Forgy,82].  This  approach  is  advantageous  for  (a)  accumulating  rules  in  a  knowledge  base, 

(b)  prototyping,  and  (c)  testing  [Giarratano,89] 

The  benefits  of  translation  among  these  paradigms  are  as  follows:  The  main  interest  in  this  paper  is  in 
leveloping  a  methodology  for  rapidly  populating  the  rules  in  knowledge  bases.  There  are  numerous  useful 
dgorithms  programmed  in  procedural  languages  that  would  be  very  useful  in  expert  systems.  They  can 
)e  translated  into  sets  of  rules  that  can  be  readily  incorporated  in  knowledge  bases. 

i^ually  important  is  that  expert  systems  can  oflfer  a  superior  testing  environment  for  procedural  programs; 
>ecause  of  their  man-machine  interface  and  capability  to  “explain”  the  rules  used  in  a  computation. 

The  approach  proposed  in  this  paper  is  illustrated  in  Figure  1.  The  visual  programming,  translation, 
ind  testing  are  embedded  in  a  graphics  and  repository  enviroxunent  that  incorporates  visualization  and 
>perations  on  programs.  Programs  in  respective  paradigms  are  stored  in  the  repository.  They  can  be 
>perated  upon  to  perform  translations.  They  can  be  visualized  graphically  to  make  them  easier  to  be 
inderstood.  A  user  can  request  a  translation  from  one  paradigm  to  another  and  benefit  from  the  advantages 
>f  the  latter  paradigm. 

rhe  focus  in  this  paper  is  on  the  translation  between  Equational  and  Rule-Based  languages.  The  translation 
Tom  Procedural  to  Equational  languages  has  been  described  in  a  previous  report  [Prywes,90]. 

Because  of  limits  of  space  in  this  paper,  the  ^proach  is  presented  through  an  example.  The  Gauss 
Elimination  algorithm  is  used  to  solve  simultaneous  linear  equations.  Solution  of  simultaneous  equations 
s  widely  need^  in  expert  systems.  Translation  of  this  algorithm  to  a  rule-based  language  is  used  here  as 
m  example.  The  algorithm  is  described  in  Section  2. 

Section  3  describes  the  visualization  of  algorithms  in  the  equational  language,  MODEL  [Szymanski&Pr3nves,88] 
ind  the  graphic  operations  for  composition  and  understanding  of  algorithms. 

The  translation  from  the  MODEL  equational  language  into  the  CLIPS  rule-based  language  [Giarratano,89] 
s  addressed  in  Section  4. 

Section  5  discusses  the  use  of  visualization  and  the  CLIPS  expert  system  for  program  testing. 
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Figure  1:  Extracting  Expertise  from  Programs,  Converting  It  into 
Rules,  and  Accumulating  the  Rules  into  a  Knowledge  Base. 


The  paper  concludes  with  the  highlights  of  the  implementation  plan  for  a  protot3rpe  of  the  environment. 


2  THE  EXAMPLE 


The  function  of  the  Gauss  Elimination  algorithm,  used  as  an  example  in  this  paper,  is  illustrated  in  Figure 
2.  The  figure  illustrates  the  input  and  output  as  sets  of  simultaneous  linear  equations.  The  illustration 
of  testing  in  Section  5  will  utilize  the  shown  set  of  four  equations.  The  algorithm  manipulates  the  input 
matrix  of  coefficients  and  right  hand  side  constants  through  successive  matrices  until  it  obtains  the  matrix 
with  the  lower  left  hand  triangle  of  zeroes.  The  solution  for  the  variables  can  then  be  readily  obtained. 


3  THE  GRAPHICS  ENVIRONMENT  FOR  VISUALIZATION  C 
EQUATIONAL  LANGUAGE 

The  visualization  of  an  Equational  language  program  is  illustrated  in  this  section.  It  consists  of  visualizing 
separately: 

(1) .  Header:  a  context-like  diagram  of  the  algorithm  depicting  its  inputs  and  outputs  (Figure  3). 

(2) .  Data  structure:  depicting  the  tree-like  structure  of  data  with  special  icons  for  one,  two,  and  three 

dimensional  array  objects.  Arrays  are  the  basic  data  structure  in  the  MODEL  Equational  language 
(Figure  4). 

(3) .  Equations:  depicting  a  data  flow  or  Petri-net  like  gnq>h  of  equations  and  array  variables  called 

‘^array  graph”.  This  graph  portrays  the  dependendes  among  variables  and  equations.  The  subscript 
expressions  of  array  elements  are  also  shown  for  each  dependency  edge  (Figure  5  and  Figure  6). 

Each  of  this  parts  consists  of  a  pair  of  displays  —  graphics  and  textual.  The  two  displays  are  coordinated 
and  consistent.  Each  graph  is  called  a  *Suew”.  It  can  be  composed,  edited,  and  visualized  through  a 
"window”  using  the  DECdesign  methodology  [DEDdesign,90].  The  palette  at  the  bottom  of  each  window 
shows  the  icons  for  objects  in  the  griph.  A  drde  denotes  an  equation.  A  rectangle  denotes  an  array. 
Shadow  arrays  denote  control  structures  such  as  a  size  of  a  dimension. 

The  graphics  and  textual  views  are  checked  to  verify  the  conustency  among  them.  There  are  also  algorithms 
to  check  and  add  details  so  that  the  equation  can  be  folly  evaluated.  This  requires  through  statically 
checking  equations  and  data  chains  for  causality,  for  existence  of  variables  and  sizes  for  all  dimensions,  and 
for  data  type  compatibility. 

Figures  3-6  portray  the  graphics  and  texts  for  the  Gauss  Elimination  example.  The  figures  espedally 
show  how  the  input  matrix  (B-i(i,  j))  is  translated  into  a  series  of  matrices  that  form  a  cube  (a(i,  j  ,k)) 
with  assodated  arrays:  qCi.k)  denotes  the  non-zero  elements  and  p(k)  denotes  the  positions  of  pivoting 
elements.  Finally,  the  output  (m-o(i,j))  is  presented.  The  internal  consistency  and  completeness  of  the 
equations  fa.n  also  be  checked  visually  by  tradng  the  dependency  edges  between  variables,  equations,  and 
the  subscript  expressions  shown  for  each  edge. 
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Figure  2:  Illustration  of  Inputs  and  Outputs  to  Gauss  Elimination. 


Input  Linear  Equations 

niji(l,l)*xl  +  m_i(l,2)*x2  +  m_i(l,3)’*'x3  +  m_i(l,4)*x4  =  m_i(1.5) 
m_i(2.1)*xl  +  m_i(2.2)*x2  +  m_i(2.3)*x3  +  mj(2,4)*x4  =  m_i(2.5) 
ni_i(3,l)*xl  +  m_i(3,2)*x2  +  nij(3.3)*x3  +  m_i(3,4)*x4  =  m_i(3.5) 
ni-_i(4,l)*xl  +  m_i(4,2)'''x2  +  m_i(4,3)*x3  +  m_i(4,4)‘''x4  =  m_i(4,5) 

i 

Gauss  Elimination  Algorithm 

F 

Output  Linear  Equations 

m_o(l,l)’''xl  +  m_o(l,2)*x2  +  m_o(l,3)’''x3  +  m_o(l,4)*x4  =  m_o(l,5) 
m_o(2,2)*x2  +  m_o(2,3)*x3  +  m_o(2,4)*x4  =  m_o(2,5) 
m_o(3,3)*x3  +  in_o(3,4)*x4  =  m_o(3,5) 
m_o(4,4)*x4  =  m_o(4,5) 
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igure  3:  Header. 

FILE  .EDIT  VIEW  TOOLS  HELP 


HEADER: 


MODULE:  Gai]Ssian_EIiininatioa; 
SOURCE:  in_file; 

TARGET:  out_filc; 


I 


HBal 


Figure  4:  Data  Structures. 


a 

i 


■ 


DATA  DECLARATIONS: 

1  in.file  IS  FILE,  /*  input  data  file  */ 

2  in.size  IS  RECORD, 

3  m_size  IS  FIELD  (NUM),  f*  size  of  the  ii^)ut  matrix,  n  */ 
2m_rec(*)ISRECORD, 

3  IS  FIELD  (FLOAT  DEQ;  f*  input  matrix,  n  x  (n+1)  */ 

1  out_file  IS  FILE,  /*  ouqnit  data  file  ♦/ 

2out_rec(*)ISRECORD, 

3  m_o(*)  IS  FIELD  (FLOAT  DEC)); 

f*  output  matrix  is  upper-triangular  if  die  input  has  unique  solution, 
otherwise,  all  elements  of  the  matrix  are  zero.  */ 
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f^igure  6:  Equations  (Text). 


EQUATIONS: 

/*  Eql:  array  dements  */ 

a(ij.k) « IF  k=l  THEN  m_i(ij)  /*  load  input  matrix  */ 

ELSE  IF  p(k*l)sO  THEN  0  /*  no  pivoting  mean  no  unique  solution  */ 

ELSE  IF  i=p(k-l)  THEN  a(k-lj4c-l)  /*  switch  pivoting  row  p(k-l)with(k-l)  row  •/ 
ELSE  IF  i»(k-l)  THEN  aO)(k-l)JJc-l)  /*switch  (k-1)  row  with  pivoting  raw  p(k-l)  */ 
ELSE  IF  i<(k-l)  THEN  a(ijjt-l)  /*  no  need  to  pivot  */ 
ELSEa(ioJc-l>a(p(k-l)j>l)*a(Uc-l>l)/a(p(k-l)Jc.lJc-l); 

I*  zeroing  the  lower  left  triangle  */ 

I*  Eq2:  non*zero  array  dements  for  pivoting  */ 

q(U)  =  IF  i-1  THEN  IF  a(LkJc)»0 1  k>l  THEN  0  ELSE  1 

ELSE  IF  i<k  THEN  0  ELSE  IF  a(ijcjc)=0  THEN  q(i.l  Jc)  ELSE  1; 

/•  Eq3;  position  of  pivoting  dement  •/ 
p(k)  =  IF  k=l  &  i=l  &  q(Lk)=l  THEN  i 

ELSE  IF  i>l  &  k<=i  THEN  IF  q(i-l  Jc)=0  &  q(ijc)=l  THEN  i 

ELSE  IF  q(m_size  JcH)  THEN  0; 

/*  p(k)=0,  if  there  is  no  no>zero  dements  *! 

/*  Eq4:  output  matrix  ♦/ 

m_o(ij)  =  IF  k=m_size  THEN  a(ij3^); 

/*  Eq5:  size  of  subscript,  i  */ 

SIZE.in_rec  *  m_size; 

/*  Eq6:  size  of  subscript,  j  */ 

SIZEmJ  =  m_size+l; 

/*  Eq7:  size  of  subscript,  k*/ 

SIZEa  =  m_size; 


s 


igure  7:  Translation  of  a  MODEL  Equation  into  a  CLIPS  Rule. 


TRANSLATION  TO  CUPS  RULES: 

/*  Eql:  airay  elements  */ 

a(y  Jc)  =  IF  k=l  THEN  m_i(io)  I*  load  input  matrix  */ 

ELSE  IF  p(k-l)=0  THEN  0  /*  no  pivoting  elements  n»an  no  unique  solution  *! 

ELSE  IF  i=p(k-l)  THEN  a(k-l  jj:-!)  /*  switch  pivoting  row  p(k-l)  with  (k-1)  row  ♦/ 
ELSE  IF  i=(k-l)  THEN  a(p(k-l)  jjc-l)  /♦switch  (k-1)  row  with  pivoting  raw  p(k-l)*/ 
ELSE  IF  i<(k-l)  THEN  a(ijjc-l)  /*  no  need  to  pivot  ♦/ 
ELSEa(ioJc-l>a(p(k-l)J>l)*a(i>U-l)/a(p(k-l)Jc-U-l); 

/*  zeroing  the  lower  left  triangle  */ 


/♦  Rulel:  confuting  a  triangular  matrix,  a(iJJ:)  ♦/ 

(deftule  rulel 

(p  =(-  ?k  I)  ?p)  /♦  does p(k-I) exist? ifso,^ has  iisvalue*/ 

(a  K-  ?k  1)  ?j  =(-  ?k  1)  ?alg)  /*  ?akj  =  a(k-l j>l)  */ 

(a  ?p  ?j  Mr  ?k  1)  ?apj)  /►  ?apj  =  a(p(k-l)j>l)  */ 

(a  ?i  ?j  Mr  ?k  1)  ?aij)  /*  ?aij  =  a(ioJc-l)  */ 

(a  ?i  M-  ?k  1)=(-  ?k  1)  ?aik)  /*  ?aik=a(U-U-l)*/ 

(a  ?p  =(-  ?k  l)=(-?k  1)  ?apj)  /*  ?apk=a(p(k-l)jc-ljc-l)*/ 

=> 

(if  (=  ?p  0)  flien  (asswt  (a  ?i  ?j  ?k  0))  /*  a(ijA!:)=0,  ie.,  no  unique  solution  */ 

else  (if  (=  ?i  ?p)  then  (assert  (a  ?i  ?j  ?k  ?akj))  /*  the  pivoting  row  */ 
else  (if  (=  ?i  =(-  ?k  1))  then  (assert  (a  ?i  ?j  ?k  ?apj))  I*  die  (k-l)tii  row  */ 
else  (if  (<  ?i  M-  ?lc  1))  then  (assert  (a  ?i  ?j  ?k  ?aij))  /*  no  pivoting  */ 
else 

(assen  (a  ?i  ?j  ?k  =(-  ?aij  (*  ?apj  (/  ?aik  ?apk)))))))))) 

/♦  zeroing  the  lower  left  triangle  ♦/ 
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4  TRANSLATION  OF  EQUATIONS  INTO  RULES 


Each  equation  is  translated  into  a  respective  rule  and  added  to  the  knowledge  base.  Figure  7  shows  the 
translation  of  the  equation  that  defines  a(i,j,k)  into  the  respective  CLIPS  rule.  As  shown, .the  equa- 
tional  representation  is  more  compact,  however  it  conveys  the  same  information  and  same  computational 
interpretation. 

The  translation  is  based  on  the  fdlowing:  Computation  of  the  left  hand  side  variable  of  an  equation  requires 
pre-existence  of  the  right  hand  side  variables.  This  is  the  basis  for  the  condition  part  of  the  rule;  namely 
that  the  right  hand  side  variables  exist.  The  assertion  part  of  the  rule  is  simUar  to  the  equation  and  it 
asserts  a  value  for  the  left  hand  side  variable.  As  elements  of  array  are  asserted  as  facts  in  a  knowledge 
base,  they  are  recognized  as  existing  and  therefore  satisfying  the  condition  part  of  some  rules  which  can 
then  be  interpreted  next,  and  so  on.  The  causality  check,  previously  mentioned,  assures  completion  of  the 
overall  computation. 


5  VISUAL  PROGRAM  TESTING  USING  AN  EXPERT  SYSTEM 


Program  testing  is  viewed  as  essentially  a  test  of  prc^am  branches  and  the  synthesis  of  these  tests.  A  key 
consideration  is  the  choice  of  test  data  [DeMillo,87,  Howden,87,  Hamlet, 88]. 

Use  of  an  expert  system  facilitates  such  a  process  through  the  following  features: 

(1) .  A  user  can  assert  the  test  data  for  any  of  the  variables  in  the  program. 

(2) .  Once  these  variables  exist,  the  expert  system  may  be  “fired”  to  execute  “branches”  (equation  chains) 

of  the  program  that  depend  on  these  variables.  This  may  continue  until  a  user  specified  breakpoint 
(stopping  point),  which  limits  the  computation,  is  reached. 

(3) .  The  expert  system  together  with  the  graphics  system  display  the  values  of  computed  variables,  which 

are  shown  for  the  last  values  of  the  indices,  and  “explain”  the  sequences  of  the  fired  lules  to  obtain 
those  values. 

To  support  this  process,  the  testing  environment  has  a  number  of  operations  that  can  be  selected  by  the 
user  from  menus.  The  breakpoint  delimiting  the  computation  is  specified  by  the  user  pointing  with  the 
mouse  to  selected  equations  or  by  key-in  of  delimiting  index  values. 

The  choice  of  operations  in  menus  is  shown  in  Figure  8.  The  exercising  of  two  operations  for  the  Gauss 
Elimination  example  is  illustrated  in  Figure  9  and  Figure  10,  respectively.  Figure  9  illustrates  not  delimiting 
the  testing  and  allowing  it  to  “fire”  from  the  asserted  input  to  the  completed  output.  It  shows  asserting  the 
input  (iaJ.(i,J))  with  the  results  in  eventually  evaluating  BL.o(i,j).  Figure  10  iUustrates  use  of  breakpoint 
delimit  the  computation  to  k>2,  namely  computing  up  to  the  second  (i.j)  matrix  of  the  three  dimensional 
array,  a(i,j,k),  i.e.,  a(i,j,2). 

6  IMPLEMENTATION  APPROACH 


The  key  features  of  a  prototype  system  under  development  are  depicted  in  Figure  11. 
As  already  noted,  the  chcMce  of  languages  is: 
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figure  8:  Menus  and  Submenus 


PROMPTING 


CHECKING 


figure  11:  Implementation. 

mplemantation  Approach 

languages: 

Procedural  Language:  FORTRAN 
Equadonal  Language:  MODEL 
Rule-Based  Language:  CLIPS 

jraphics: 

DECdesign  -  Graphics  Environment  for  programming  and  design. 
Methodology  Implementation  Facility  (MDF)  -  Program  generation 

for  graphics  methodologies. 
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1) .  FORTRAN  for  the  Procedural  language. 

2) .  MODEL  for  the  Equational  language. 

3) .  CLIPS  for  the  Rule-Based  language. 

tie  translation  from  FORTRAN  to  MODEL  was  described  in  [Prywes,90].  The  environment,  and  this 
iper,  focus  on  the  latter  two  classes  of  languages.  The  choice  of  MODEL  is  primarily  due  to  existence  of 
compiler /procedural-program-generator  for  MODEL  [Prywes&Pnueli,83,  Szymanski&Prywes,88]  as  well 
the  above  mentioned  checking  algorithms.  The  choice  of  CLIPS  is  primarily  due  to  existence  of  an 
Gdent  language  Interpreter  written  in  C  with  a  high  degree  of  portability  [GiaiTatano,89]. 

he  DECdesign  graphics  environment  was  selected  primarily  due  to  existence  of  Methodology  Implementa- 
3n  Facility  (MIF).  This  is  a  program  generator  that  accepts  as  input  definition  of  a  graphic  methodology 
Dntity-Relation  graph  structures,  icons,  menus,  and  routines).  It  generates  programs  incorporated  in 
ECdesign  graphics  system  which  implement  the  defined  methodology.  Use  of  this  graphics  approach  was 
iscribed  in  [Prywes,91]. 
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Definitions 


We  make  some  definitions 

for  the  purpose  of  this  exposition. 

1.  knowledge: 

a.  definition  of  concepts  and  their  attributes 

aircraft:  vehicle 

speed:  100  mph  —  1600  mph  provides 

default 

transport  aircraft:  aircraft 

speed:  400  mph  —  700  mph 

•  •  • 

trip:  .  .  . 

•  •  • 

begin:  .  .  . 

•  •  • 

b.  rules  or  constraints  relating  concepts 

trip.begin(aircraft):  airport 

•  •  • 

2.  knowledge  base 

a.  A  collection  of  definitions  and  rules 

for  some  domain,  using  these  concepts 

b.  A  database  of  instances  for  that  domain 
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Problem  Statement 

Knowledge  pertains  to  specific 
application  domains 
transport,  manufacturing,  .  .  . 
and  subdomains 

passenger  transport,  freight  transport,  . 
and  sub-subdomains 

military  .  corporate  ,  ,,  commercial  . 
with  distinct,  possibly  overlapping  sets  of 
concepts: 
aircraft,  load, 

and  instances:  B747,  C141,  C17,  F16,  .  .  . 

so  that  rules  for  one  domain  cannot 

be  assumed  to  hold  in  another  domain 


Partial  Solutions 

1.  Define  all  terms  globally 

i.e.,  establish  standards 
Problems:  affects  all  rules,  un-natural, 
greatly  increases  number  of  terms 

2.  Delimit  all  rules  by  domain  predicates 

i.e.,  limit  applicability  of  rules 
Problems:  weakens  rules,  slows  processing 

3.  Keep  knowledge-bases  distinct  and 

fuse  their  resuilts 

i.e.,  permit  heterogeneity 
Problems:  most  decisions  require 

information  from  multiple  domains 
creates  uncertainty  due  to  doamin  mismatch 
(result  [not  domain]  scope, 
abstraction  level) 


We  focus  now  on  the  third  alternative 
(not  to  be  used  exclusively) 
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Heterogenous  KBs 

Advantages: 

1.  coherent  maintenance  by  specialist (s) 

2.  manageable  size  for  processing 

3.  parallel  processing  is  natural 

4.  effective  linkages  to  one  or  few 

databases 

sensor-based  sources 
simulations 


Disadvantages: 

1.  results  must  be  merged/fused 

by  higher  level  knowledge  processing 

2.  uncertainty  will  be  created 

when  fusing 
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the  past 
the  present 
the  future 
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Paradigm 

Participating  modules  provide 


Information  Services 


not  direct  knowledge  or  data 

Are  independent  contractors, 
not  integrated  subsystems 

These  notions  depend  on 

1.  specialized  processors, 

associated  with  knowledge  maintainers 

2.  effective  networks 

3.  catalog  nodes  describing  the  services 

versus  knowledge  integration, 
database  integration 
integration  of 

databases, 

sensor-based  processing 
simulations 
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Mediators 


axe  knowledge-based  processes 

1.  which  understand 

lower  level  service  providers 

2.  can  formulate  information  requests 

so  that 

useful  and 

mergable  information  is  returned 

3.  can  schedule  good*  execution  sequences 

(depends  on  having  bound  arguments 
and  performance  estimates) 

4.  and  report  combined  results 

with  rehabUity  assessments 
to  the  end-users’  appUcations. 

*optimal  requires  global  knowledge, 
partitioning  induces  suboptimality 
but  improves  computability  of  local  optima 

(large  systems  — ►  heuristics  also 

suboptimal)  — 
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Why  mediators? 

End-user  applications  must  focus  on 
decision  maJang  and  planning 
not  on  management  of  information  resources 


Follows  management  principles, 

where  the  decision-maker  relies  on 
specialists,  who  use 
base  resources,  as 
databases 
current  findings 
projections  from  what-if  taisks 


recall: 

Between  the  minds  that  plan  and  the  hands  that  build 
there  must  he  a  mediator 
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Architecture 


Knowledge-based  modules: 


Mediators 


between 

User  Applications  (private  programs) 
and 

Autonomous  information  resources  (public) 


All  Mediator  modules  are  combinations  of 

PEOPLE  and  MACHINES 
Arranged  top-down: 


result  ^  decision  making 

1  Independent  APPLICATIONS  on  workstations 

network  services  to  information  servers 

2  Multiple  MEDIATORS 

network  services  to  data  servers 

3  Multiple  INFORMATION  SOURCES,  ETC. 

input  ^  real-world  changes,  effects 
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Support  modules 

Revisting  the  resource  layer 

All  modules  use  distinct  data 
if  shared  database, 

synchronized  through  database  locking 
All  modules  use  distinct  knowledge 
if  shared,  have  local  working  copy 
Many  modules  use  diflFerent  reasoning 

depends  on  type  of  resources,  knowledge 
Up  to  now  we  have  impHed  that 

support  modules  use  AI  paradigms 
but  this  architecture  does  not  require  that 

•  knowledge  is  also  encoded  in 
o  algorithmic  programs 
fine  if  rules  are  stable 
o  OR  programs 

great  for  dealing  with  large  number  of  variables 
o  databases 

use  efficient,  simple  inference  techniques 
for  large  volumes 
o  spreadsheets 
for  balancing  projections 

as  long  as  their  interfaces  are  machine-friendly 
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Mediator  modules 


A  mediation  module  should  be  small  and  simple 
(as  a  any  software  module) 

Maintained  by  one  expert 

or  coherent  group  of  experts, 
assisted  by  monitors  on  the  data. 

Mediators  are  inspectable  by  the  potential  users 
for  selection,  evaluation 


Results 


Abstractions 
constraints  filters 

selection  criteria  deduction 

closure  for  completeness  i 

monitors  for  correctness  t  for  expert 

-  maintenance 

Source  Information 


F-12 


H-KB-12 


Distribution  of  Mediators 


Homebase:  at  expert 
Active  nodes:  arbitrary 
independent  copies 
bound  to  workstation 
bound  to  information  sources 


In  general,  independent  of  sources: 

1  The  mediator  contains  knowledge 

beyond  the  scope  of  the  sources. 

2  Often  must  deal  with  uncertainty 

about  sources 

3  Mediators  often  access  multiple  sources 

to  perform  appropriate  fusion 

In  general,  independent  of  workstation: 

1  too  large,  too  many 

2  Intrusive  maintenance 
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Query  J,  Relevant  information  t  Inspection  i 


Mediatory 

Formatted 
queries  i 


Expert 
Maintainer 

Bulky  I  Triggered 

responses  events  t 


Database  Sensor  Simulation 

Information  Information  Information 


All  modules  are  distributed 

over  nationwide  networks. 
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Alternate  AI  Paradigms 

This  architecture  separates  also 
two  competing  AI  paradigms 

•  Pragmatic  systems  for  the  end-user 

that  assist  in  decision-making 
and  must  face  the  uncertain  future 

but 

that  are  difficult  to  scale  up 

•  Formal  systems  for  the  mediators 

that  behave  predictably 
are  subject  to  optimization 
but 

that  can  only  deal  in  bounded  scopes 

you  can  never  formally  predict  the  future 

in  an  open  world 
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Research  Issues 


The  Interface  to  Mediators  requires 

a.  A  new  language  level  beyond  SQL,  4GLs 

(Why  a  language? 

Breadth  in  accessing  resorces 
Flexibility  in  composition, 

Rearrangement  for  efficiency 
Modularity  for  reallocation  to 
computing  nodes,  ...  ) 

b.  Machine-readable  descriptions 

of  needed  paarameters 
candidate  results 

c.  Uncertainty  computations 

caused  by  domain  mismatch 

d.  Dealing  with  performance  issues: 

Obtaining  estimates  (mean,sd)  of 
costs,  result  cardinalities 
Caching 
Binding 
Compiling 
Reorganization 
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The  Problem 


Support  the  retrieval  of  information  in  com¬ 
plex  domains,  which  require  the  integration  of 
multiple  databases. 

One  must: 

•  Know  what  databases  are  available 

•  Speak  the  language  of  each  individual 
database 

•  Represent  the  task  in  terms  of  the  data¬ 
bases 

•  Find  a  combination  of  queries  that  corre¬ 
spond  to  the  task 

•  Obtain  some  degree  of  “optimality" 

•  Match  the  outputs  of  one  query  with  the 
requirements  of  others 

•  Interpret  the  results  in  terms  of  the  origi¬ 
nal  task. 


SIMS’  Solution:  “Soft”  Integration 

SIMS  is  a  broker  which  coordinates  access  to 
the  databases.  Important  features  of  SIMS 
are: 

•  Database-independent  interaction  ian- 
guage:  the  user  does  not  need  to  be  fa¬ 
miliar  with  the  databases  or  their  organi¬ 
zation. 

•  Logical  integration:  the  databases  re¬ 
main  separate  and  are  not  reprogrammed 
into  a  custom  system. 

•  Dynamic  integration  via  planning:  the 
combination  of  queries  that  perform  a 
task  is  determined  when  the  task  is  pro¬ 
posed  to  the  system. 

•  Models  of  databases  and  the  application 
domain  are  consulted  by  the  SIMS  plan¬ 
ner. 

•  Reformulation  may  be  performed  on  task 
statements. 


Current  SIMS  Status 


SIMS  now: 

•  Operates  in  the  Navy  briefing  domain  and 
in  transportation  planning  domain. 

•  Models  four  Oracle  database  tables,  and 
seven  primitive  databases. 

•  Accesses  databases  automatically  over  a 
local  area  network. 

•  Uses  domain  model  information  to  correct 
user  errors:  for  example,  upon  encounter¬ 
ing  an  inappropriate  term,  user  is  offered 
a  list  of  semantically  acceptable  alterna¬ 
tives. 


Reformulation  Component 
Goal  Repair  strategies: 


•  Involve  replacing  elements  of  the  goal 
specification  until  a  correct  one  is  arrived 
at. 

•  Are  based  on  structure  of  domain  and  goal 
knowledge. 


*  Ships,  Area  =  South  China  Sea 

*  Area  :  Employment  Area,  or  Location? 


Current  SIMS  Status,  cont. 


Uses  domain  semantic  constraints  to  op¬ 
timize  access:  for  example,  LOOM  num¬ 
ber  restrictions  affect  use  of  looping  in  re¬ 
trieval. 


Uses  domain  model  structure  to  reformu¬ 
late  goals  as  equivalent  conjunction  or  dis¬ 
junction  when  necessary. 


Reformulation  Component 


Some  Goal  Replacement  strategies: 


•  Conjunction:  A  combination  of  goals 
provide  for  the  original  one. 

Ship  status  =» 

Location,  course,  employment  schedule 

•  Specification:  A  set  of  goals  requiring 
additional  information  covers  the  original 
one. 

Ship  location 

Ship  location  by  country 

•  Subsumption:  A  super-ordinate  goal 
subsumes  the  original  one. 

Ship  displacement  ^ 


Displacement  by  class 


Sample  Query 


(:AND  (CURRENT-MAJOR-EMPLOYMENT  TShip  "OPS") 
(CLASS  ?Sliip  ?Class) 

(OVERALL-READINESS  ?Sliip  5)) 
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raatrtetten  of  8  on  tho  rolo  ICIRDV 


with  •  v«lu«  r««trietion  of  8  on  tho  rolo  ICIROV 


Hierarchical  Query  Reformulation 


Given:  A  query  to  a  database  expressed  in  a 
high-levei  KR  language. 

Find:  A  formulation  of  the  query  that  can  be 
implemented  efficiently. 

Approach:  Exploit  the  hierarchical  structure 
of  the  search  space  to  guide  reformulation. 
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Example 


Query: 

List  the  Soviet  ships  with  a  capacity  greater 
than  40,000  tons. 


(and  (Capacity  >  40,000)  • 

(shiptype  =  ship) 
(Registry  =  Soviet) 
(Shipname  =  ?)) 


Knowiedge  Base: 

Soviet  Ships 


Tankers  |  Frigates 

/  Cargo  Ships  \ 

/ 

/Capacity  max  \capacity 

capacity  \ 

62,600  ’  3,800 

13,900 
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Databases: 


Owner  Database  ( on-line ) 
Shlpname  Oimer  Shlptype  Registry 


Capacity  Database  (batch) 


The  Search  Process 


Query  Processing: 

1.  Select  a  formulation  of  the  query. 

2.  Select  the  databases  and  access  paths. 

3.  Estimate  the  cost  of  executing  this  query 
for  the  selected  databases  and  paths. 

4.  If  the  estimated  cost  is  too  high,  search 
for: 

(a)  Alternative  implementations. 

(b)  Alternative  databases  and  access 
paths. 

(c)  Reformulations  of  the  query. 
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Reformulation  Example 


•  Capacity  database  is  batch,  so  processing 
the  original  query  will  take  several  hours. 

•  The  system  decides  to  spend  a  few  min¬ 
utes  attempting  to  reformulate  the  query 
to  eliminate  the  capacity  constraint. 

•  KB  model  yields: 
capacity  >  40,000  — >  tanker 

•  Capacity  constraint  can  be  dropped  from 
query. 

(and  (shiptype  =  tanker) 

(registry  =  Soviet) 

(shipname  =  ?)) 
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Query  Reformulation 


Problem:  Given  a  query  in  a  high-level  lan¬ 
guage,  find  equivalent  formulations  of  the 
query. 

Approach: 

•  Add,  delete,  and  replace  constraints  in  a 
query. 

•  Knowledge  base  makes  it  possible  to  per¬ 
form  complex  reformulations  on  a  query. 

•  Search  is  directed  by  implementation  at 
lower  levels. 

•  Similar  to  work  by  King,  1981,  but  not 
limited  to  semantic  integrity  constraints. 
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Database  and  Access  Path  Selection 


Problem:  Given  a  high-level  database  query, 
find  a  set  of  databases  and  corresponding  ac¬ 
cess  paths  that  can  be  used  to  answer  the 
query. 

Approach: 

•  Search  through  the  models  of  the 
databases  to  select  the  databases  and  ac¬ 
cess  paths. 

•  Databases  may  be  overlapping  and  con¬ 
tain  multiple  access  paths. 

•  Search  is  directed  by  the  implementation 
costs. 

•  Allows  flexible  access  in  contrast  to  oth¬ 
er  systems,  which  either  assume  a  sin¬ 
gle,  fixed  access  path  or  expect  the  access 
path  to  be  given  in  the  query  (as  in  SQL). 
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Query  Implementation 


Problem:  Given  a  query  and  the  selected 
databases  and  access  paths,  find  the  low-level 
database  operations  that  can  be  used  to  im¬ 
plement  the  query. 

Approach: 

•  This  level  corresponds  to  implementing  an 
SQL  query. 

•  Requires  determining  the  Select,  Project, 
Join,  Scan,  and  Sort  operations  and  find¬ 
ing  an  efficient  ordering  to  implement  a 
query. 

•  Problem  has  been  studied  extensively  in 
the  database  field,  so  we  will  use  existing 
techniques. 
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Summary 

•  Use  the  knowledge  base  to  reformulate 
the  database  queries. 

•  Consider  alternative  databases  and  access 
paths  for  extracting  the  desired  informa¬ 
tion. 

•  Exploit  the  hierarchical  structure  of  the 
problem  to  guide  the  search  for  better 
queries. 

•  Use  existing  query  implementation  tech¬ 
niques  to  determine  when  to  search  for 
a  reformulation  and  how  much  time  to 
spend  expend. 
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APPENDIX  H 


APPENDIX  H 


Causal  Process  Modeling 

B.  Chandrasekaran 


LAIR,  Ohio  State  University 

Laboratory  tar  AIRMearch,  IhaOhioStatoUniveratty 


Functional  Representation  of  Devices 

Hypothesis:  For  the  tasks  of  design  &  diagnosis,  devices  should  be 
represented  by  explicit  indicators  of  how  its  furtctlona  arise  as  a  result  of 
causal  aeauences  fbehavlor)  made  bv  the  functions  of  the  components 
&  their  relations.  What  role  first  principle  (physical  laws)  knowledge  plays 
in  explaining  causal  sequences  should  be  part  of  this  structure. 

Laboratory  tar  AIRoMarch,  Iha  Ohio  Stato  Ltnhranliy 
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CPD  is  A  Causal  Story 


1 .  Causal  Pfocass  Dascriptions  and  Functional  Raprasantations  of  Davice 

*  (Partial)  Stata  of  a  physical  system: 

Predicatas  ovar  state  variablas  of  interest 

*  A  Causal  Process  Description  (CPD)  or  a  Causal  Story 

is  a  directed  graph  of  partial  states  of  a  system,  where  the  initial  state 
is  a  predicate  over  an  ’exogenous*  state  variable,  and  the  following  states 
are  causal  consequences  of  interest 
HOW  THINGS  WORK 


LaboraBry  tor  Al  RMMrch,  Th*  Ohto  Slato  Univaraiy 


CPD  Reflects  Agent's  Explanatory  Goals 

*  The  CPD  has  to  be  obey  certain 
rules  of  composition,  that  is, 
there  is  a  story  grammar. 

*  CPD  is  an  agent-epecinc  description 
of  a  process,  not  agent-independent, 
abstract  representation  of  the  process. 

That  is,  the  choice  of  the  state 
variables  in  the  process  decriplion 
reflects  the  agent’s  explanatory  goals. 


LSbowiory  tor  Al  nMiirch,  Tlw  Olito  Saw  LMvaraitjr 
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Annotations  of  CPD  Links:  1.  Process  Details 

CPD's  links  ars  ANNOTATED  using  foltowing  taxonomy  of  links: 
*  1.  By  <CPD> 

—  This  is  a  pointer  to  anothar  causal  process  description 
that  is  already  present  as  part  of  prior  knowledge.. 

o - >  o 

A  By  CPD-1  B 


CPD-1  :  0 - >0 - 0 . — >  o 

A  at  a2  B 


Example  usage:  Patil's  work  on  Abel 

Helps  us  to  reuse  CPD's. 

Hierarchicai  ways  of  organizing  processes. 

Uboniory  for  Al  noiewch,  Itw  Ohio  Staii  Univoroi^ 


Annotations  of  CPD  Links:  2.  Component  Function 

By  <function>  of  <oomponent> 

~  If  the  physical  system  can  be  decomposed  into  subsystems 
(components)  and  the  components  can  be  desabed  in  terms  of 
input/output  relations  called  Functions  On  the  sense  of  ’roles”, 
not  necesarily  ’intentions’),  then,  a  causal  transition  at  the  device  level 
can  be  explained  by  pointing  to  the  function  of  a  component. 

Helps  us  to  understand  system  behaviors  as  a  hierarchical 
composition  of  subsystem  behavtors.  The  component  function 
in  turn  may  be  explained  as  a  CPD  at  the  time  of  component 
modeling. 


Ubotaiory  for  Al  ntHTCh.  Ih*  Ohio  Staw  Unkrarrity 
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Annotations  of  CPD  Links:  3.  First  Principles 

*3.  As-p«r  <domain-law> 

-  Some  causal  transitions  are  simply 
explained  by  pointing  to  a  physical  law, 
e.g.,  Ohm's  law  or  F  -  ma,  etc.,  as 
instantiated  to  the  current  configuration. 

{Voltage  >  5  at  terminals  LI  and  L2}  - 

{Current «  1  amp  thru  W1} 

(Plffi»  where  Math  Models 
&  numerical  calculations  can  come  in) 


Labofanry  for  Al  Retearch,  Tha  Ohio  State  Univarthy 


Annotations  of  CPD  Links:  "By-Abstraction"  Link 

The  next  two  are  not  causal  transitions,  but  are  often  part  of  causal  process  descriptions. 
*  4.  By  <abstraction> 

This  transition  enables  the  agent  to  change  levels  of  abstraction  in  the  CPD. 

For  example,  to  explain  toe  state  calied  ''amplificalion  is  10’  from  toe  level 
of  analysis  in  terms  of  currents  and  voltages,  an  appropriate  abstraction 
operator  is  defined. 

We  have  a  number  of  types  of  abstraction; 
state  abstraction,  process  abstraction 
(e.g.,  a  repeated  transition  between  two  values 
may  be  abstracted  as  oscillation  of  certain  type.) 


UbcciMry  for  Al  ft— ■erch.  The  Ohio  Stw  Univtriity 
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Example  of  "By-Abstraction"  Link 


Laboratory  for  Al  RMMvch,  Ihe  Ohio  Stata  Uitivaraily 


Example  of  Asbstraction  of  Process  Into  State 


Laboratory  for  AIRaaaardi,  Tha  Ohio  Staia  Univoraiiy 
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Example  of  ”By-inference”  Link 


*5.  By-inf«r«nc« 

Many  CPO's  have  transitions  which  are  state  descriptbns  arrived  at  by 
inference  from  previous  state  descriptions,  andgenerai  domain  knowiedge. 

Thus  a  CPO  is  a  comprehensible  story 

about  how  the  causal  conclusion  is  drawn 

about  a  relevant  behavbr  of  the  system  as  a 

whob  from  knowiedge  of  the  behavbrs  of  the 

parts,  their  connections,  and  domain  laws. 

Laboratory  tor  At  Resoareh,  Tbo  Ohio  State  Univarahy 


FUNCTIONAL  REPRESENTATIONS  (FR) 

A  Family  of  Representations  Called  Functbnal 
Representatbns  have  Been  Designed  and  Complex 
Bbbgbxal  artd  Engineering  Systems  have  been 
Represented  at  our  Lab.  Given  Observatbns  to  be 
explained,  such  representatbns  have  been  used  to 
do  Diagnostb  Reasoning. 

Laboratory  tor  At  naeaarch.  The  Ohio  StataUntoarahy 
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Device  Representation  as  a  Causal  Process 

Device  Representation  (Functtonal  Representation) 

We  understand  devioa  functioning  as  a  causal 
process.  HOW  A  DEVICE  WORKS 
*  Function:  Pred(output  state  variables) 

Makes  <function> 

H  <exogenous  state  variables  -  .> 

Provided  <certain  background  conditbns  on  structure> 

*  By<CPD> 

*  CPD  description 

*  Structure:  Component  names  and  associated  function  names, 

and  connectbns  between  components 

Laboratory  for  Al  Raaoareh,  The  Ohio  State  Univoraity 


Device  Representation  as  Hierarchies  of  CPD'S 

Since  CPD  descriptbns  use  functions  of 
components  for  parts  of  their  annotatbns, 
how  a  device  works  is  represented  as 
hbrarchbal  oompositbn  of  CPp’s,  each  of ' 
which  point  to  other  CPD's  and  to  domain 
laws  for  causal  explanatbn  and  to  state 
and  process  abstractbns. 


Laboratory  for  Al  RMoarch,  iho  Ohfo  Slato  Urtlvaniiy 
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Lamp  Circuit 


structure 

S«rialiy*Connactad  {Switch,  battery.  Lamp,  w1,  w2,  w3} 
Switch  Functions: 

1.  Ilaka  (cloaa  connection)  if  (on) 

2.  Maka  (opan^onnaction)  If  (off) 
Battary:  Function:  Maka  (voitaga  at  terminals) 

If  (connected  to  terminals) 

Lamp: 


LaboruorylbrAIRMaarch,  'Rw  Ohio  StaM  Univarahy 


Causal  Account  of  How  Lamp  Circuit  Works 


Ubwaiwy  tor  Al  naiaoreh.  Tho  Ohto  StM  Unkwahy 
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Properties  of  FR 


•  Componsnt-Subcomponent  structure  representation 

can  be  repeated  several  layers 

•  *No-f unctlori'ln-structurtf'  obeyed  for  each  level 

•  Behavioral  spec's  of  components  not  mentioned, 

only  functions.  Parts  functionally  equivalent 
but  working  differently  can  be  substituted. 

Labontory  for  Al  ReMarch,  Tha  Ohio  Saua  Univertity 


Functional  Representations  and  Diagnosis 


taborattry  for  Al  Roiaarch,  Tha  Ohto  Staia  Univaraity 
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Work  at  LAIR: 


*  Stickian,  1987.  Compilation  of  Diagnostic  Knowledge. 
Given  a  functional  model  of  a  device,  malfunctions  and 
states  indicative  of  maKunctins  can  be  derived. 

Because  of  the  hierarchical,  system  subsytem- 
nature  of  CPD  representation,  we  get  a 
hierarchical  malfunction  representation. 

Sticklen  has  gone  on  to  develop  a  family  of  models 
of  parametric  simulationalgorithms.  and  has  applied 
to  ecological  and  materials  modeiirtg. 


Laboratory  for  AIRMMTCh.  Iba  Ohfo  Stala  Unlvaniiy 


Work  at  LAIR  (conTd) 

*  Kauneke,  1989.  Use  of  CFO's  to  generate  explanations 
of  diagnoses  by  demostrating  a  causal  story. 

*  Goal  1989.  Use  of  CPD  and  functional  primitives  to 
index  design  cases  in  memory  (why  a  design  works), 
to  retrieve  relevant  cases  from  memory,  and  to  identify 
possUe  substructures  to  modify. 

*  Allatnang  1990.  Represent  algorithms  as  generic  devices, 
with  altemativecode^el  implemerTtations. 

Used  for  program  debugging. 


Laboratory  for  At  naiiarch,  Tba  Ottto  StUa  Unfoaraiiy 
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Work  at  LAIR  (cont*d) 


*  Dardon,  89, 90.  (Phibsophar  of  sdenoe) 
Repraserrtation  of  scientific  theories  in  biology 
and  tracing  debugging  of  theories. 


LaboniioryferAIRetMrch,  Tb*  Ohto  Stale  Univonity 


Open  Issues 


*  Semantics  of  the  primitives 

*  Automatic  generation  of  CPD's  and  functbnal 
representations  from  structure.  Involves  recognitbn 
of  functbns  by  using  functbnal  templates. 

*  Applbatbn  to  more  oompbx  devices 

*  Grounding  of  states  in  a  general  theory 
of  substances  (Goel)  and  visual 
representatbns  (BC  and  Narayanan). 


LaboiBloryferAIRaaaarc>),  Tba  OTito  Stale  IMVenlty 
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COOPERATIVE  ANSWERING:  THEORY 

AND  PRACTICE 


Terry  Gaasterland^ 
Jack  Minker^’^ 


Department  of  Computer  Science^ 

and 

Institute  for  Advanced  Computer  Studies^’^ 
University  of  Maryland 
College  Park,  Maryland  20742 


OVERVIEW  OF  COOPERATIVE  ANSWERING 

METHODS 


•  PRINCIPLE  OF  COOPERATION 

•  EARLY  APPROACHES 

-  AI  APPROACHES 

-  ADDRESSING  USER  GOALS 

-  RELATIONAL  DATABASE  APPROACHES 

-  DEDUCTIVE  DATABASE  APPROACHES 

•  AN  INTEGRATED  COOPERATIVE  ANSWERING  SYSTEM 

•  SUMMARY 
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(Grice  1975); 


A  COOPERATIVE  ANSWER  IS  A 

•  CORRECT 

•  USEFUL 

•  NON-MISLEADING 


ANSWER  TO  A  QUERY. 


THE  PRINCIPLE  OF  COOPERATION 


EXAMPLE: 


QUERY: 

“In  Spring  1990,  how  many  international  flights  arrived  on  time  in 
Cedar  Rapids,  Iowa?” 


RESPONSE: 

“None.” 


COOPERATIVE  RESPONSE: 

“None,  there  are  no  international  flights  into  Cedar  Rapids,  Iowa.” 
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EARLY  APPROACHES  TO  COOPERATIVE 

ANSWERING 


JOSHI/WEBBER  ’82: 

•  Consider  user  beliefs  to  anticipate  user  expectations. 

•  Provide  extra  info  to  prevent  misconceptions. 

Q:  “Is  Sam  an  associate  professor?” 

System:  User  believes  most  associate  professors  have  tenure. 
System:  Sam  is  not  tenured. 

System:  Sam  is  an  associate  professor. 

A:  “Yes,  but  he  doesn’t  have  tenure.” 

KAPLAN  ’82: 

•  Detect  and  present  false  presuppositions 

Q:  “How  many  students  failed  CS  401  last  semester?” 

A:  ‘There  was  no  such  course.” 
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EARLY  APPROACHES  TO  COOPERATIVE 

ANSWERING 


HIRSHBERG  ’85:  Use  scalar  implicature  to  answer  yes/no 

questions 

MAYS  ’81:  Use  schema  to  correct  false  presuppositions 

McCOY  ’85:  Use  world  knowledge  to  correct  object  oriented 

misconceptions 
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EARLY  APPROACHES  TO  COOPERATIVE 

ANSWERING 


JANAS  ’81: 

•  Compute  subqueries  of  a  query  and  see  if  they  fail  or  not. 

•  Integrity  constraints  can  eliminate  some  subqueries. 

Q:  “Find  all  red  cars  in  the  database  owned  by  employees.” 

(Q  fails) 

SUBQUERY  1:  “Find  all  red  cars.” 

SUBQUERY  2:  “Find  all  cars  owned  by  employees.” 

IC;  “All  cars  are  owned  by  employees.” 

(SUBQUERY  1  fails) 
(SUBQUERY  2  true  by  IC) 
ANSWER:  “None.  There  are  no  red  cars  in  the  database.” 
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EARLY  APPROACHES  TO  COOPERATIVE 

ANSWERING 


LEHNERT  ’81: 

•  Classify  questions  into  conceptual  categories 

Q:  “How  did  John  take  the  exam?” 
SYSTEM:  Enablement  Question 
A:  “He  crammed  the  night  before  ” 
SYSTEM:  Instrument /Procedural  Question 
A:  “He  took  it  with  a  pen.” 
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ADDRESSING  USER  GOALS:  AI  APPROACHES 

POLLACK  ’85: 

•  Detect  how  question  fits  into  user  plan 

•  Offer  alternative  answer  that  facilitates  plan 

Q:  “Under  VI  editor,  how  can  I  delete  ctrl-Z?” 

SYSTEM:  user  typed  ctrl-Z  and  wants  to  undo  it 

A:  “ctrl-Z  has  stopped  the  current  process.  lype  ‘fg’  to  start  it 
again.” 


ALLEN  ’82:  Detect  user  goals  and  potential  obstacles 
WAHLSTER  ’83:  Detect  user  goals  and  anticipate  questions 
QUILICI/DYER/FLOWERS  ’88:  Correct  plan-oriented 
misconceptions. 

CARBERRY  ’88:  User  models  help  form  answers 
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ADDRESSING  USER  GOALS:  RDB  APPROACHES 

MOTRO  ’87: 

•  “Neighborhood  Information” 

•  Compute  ‘distance’  between  values  in  query  and  values  in  possible 
answers. 

•  Attributes  are  weighted  by  user  priority. 

Q:  “What  moderately  priced,  very  good,  Chinese  restaurant  is 
in  Chinatown  or  Westwood?” 

SYSTEM: 

restaurant(Jasmine_Gardens,Chinese,Chinatown,Mod,Gk)od). 
distance  =  (Weight  *  |Very-Gk)od  -  Good  |  ) 

SYSTEM: 

restaurant(Nippon,Japanese,Downtown,Expensive,VeryGd). 
distance  =  (W1  *  | Chinese  -  Japanese])  -I- 

(W2  *  [Downtown  -  Chinatown])  + 

(W3  *  [Moderate  -  Expensive]) 

A:  (All  restaurants  whose  tuples  are  ‘near’  the  query). 
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ADDRESSING  USER  GOALS:  RDB  APPROACHES 


CHU/LEE  ’90: 

•  Use  partitioned  generalization  hierarchy  over  DB  domain 

•  For  more  answers,  translate  query  into  new  partition 

•  Formalized  thru  relational  algebra 

Q:  “Which  flights  go  from  DCA  to  LAX  cost  less  than  $300?” 
SYSTEM:  flight  isa  mode-of-travel 
train  isa  mode-of=travel 
DCA  is-in-area  DC 
LAX  is-in-area  LA 

Q:  “Which  trains  go  from  a  train  station  in  DC  to  a  train  station 
in  LA?” 

A:  (list  of  flights  and  list  of  trains). 
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ADDRESSING  USER  GOALS:  DDE  APPROACHES 


CUPPENS/DEMOLOMBE  ’88: 

•  Entity  relationship  model  with  hierarchy 

•  Add  to  query  new  arguments  related  to  current  arguments 

•  Formally  defined  thru  logic 

Q:  “Which  flights  from  Paris  to  NY  leave  between  7  and  11  am.?” 

SYSTEM:  topic(departure-time,TIME). 

topic(arrival“time,TIME) . 

Q:  and  when  do  they  arrive?” 

A:  “PanAm  flight  101  leaves  at  7:30am  and  arrives  at  noon”. 
American  flight  733  leaves  at  11:00am  and  arrives  at  3:00pm” 
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TAILORING  ANSWERS  TO  USERS 

PARIS  ’88: 

•  Address  user’s  level  of  expertese  in  content  of  answer 

Q:  “What  is  a  telephone?” 

SYSTEM;  user  is  an  engineer 

A;  (technical  description  of  parts  and  how  they  work) 

SYSTEM;  user  is  an  S-y ear-old 

A;  (visual  description  of  object  and  what  it  does) 


1-13 


INTENSIONAL  ANSWERS 


IMIELINSKI 

•  INTENSIONAL  ANSWER  (lA)  IS  EQUIVALENT  TO  QUERY 

•  lA  =  SET  OF  EDB  FACTS  +  SET  OF  NEW  IDE  RULES 

Q:  Q(Z)  *«—  teach(siiiith,Z). 

“List  all  courses  that  Smith  can  teach.” 

IDB:  teach(X,Y)  teach(X,Z),  prerequisite(Y,Z). 

“X  can  leach  Y  if  X  can  teach  Z,  and  Y  is  a  prerequisite  of 
Z.” 

EDB:  teach(smith,math400). 

EDB :  prerequisite(math350,math400) .  prerequisite(math300,math350) 

lA:  “Math  400  and  any  prerequisite  courses.” 

Q(Z)  ^  prerequisite(Z,Y),Q(Y). 

Q(math400). 
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INTENSIONAL  ANSWERS:  OTHER  WORK 


•  CHOLVY/DEMOLOMBE 

•  PIROTTE/ROELANTS 
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INDUCTIVE  RESPONSES 


CORELLA,  SCHUM/MUNTZ: 

•  After  calculating  extension,  find  generalizations. 
DB  DOMAIN:  {  ann,  sue,  mary  } 


IDB: 

technician(ann).  technician(sue).  researcher(mary). 

employee(X)  researcher(X) 

employee(X)  technician(X) 

works_9-to_5(X)  secretary (X). 

works_9_to_5(X)  technician(X). 

QUERY:  works_9-to_5(X). 

ANSWER:  {ann,sue}/X 

INDUCTIVE  ANSWERS 

technician(X) 
employee(X)  +  X^  mary 
2/3  employees 
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APPENDIX  J 


APPENDIX  J 


+ 


+ 


CoBase: 

A  Cooperative  Distributed 

Database 


Wesley  W.  Chu 

Computer  Science  Department 
University  of  California,  Los  Angeles 


Supported  by  DARPA  contract 
N00174-91-C-0107 


Cooperative  Distributed  Database 

Systems 


The  next  generation  of  distributed  database 

« 

systems,  using  Inference  techniques  to  pro¬ 
vide: 

•  Fault  Tolerance  Capabilities 

•  Cooperative  Query  Answering 

Validate  concepts  with  prototype  cooperative 
distributed  DDBMS 


Cooperative  Query  Answering 


Motivation 

Conventional  query  answering 

•  provides  too  much  data 

•  may  obscure  real  meaning  of  data 

•  time  consuming 

•  needs  to  know  the  detailed  database  schema 

•  cannot  get  approximate  answer  if  full  data 
is  unavailable 

•  cannot  analyze  the  intent  of  the  user  and 
derive  relevant  information  if  the  exact  an¬ 
swer  is  not  available 

•  cannot  answer  conceptual  queries 


Cooperative  Query  Answering 
Derive  Summary  Answers 


Derive  Intensional  Answers 


Derive  Approximate  Answers 


Answer  Conceptual  Queries 


Examples  of  Cooperative  Query 

Answering 

Intensional  query  answering 

Q:  List  all  cars  with  air  bags. 

Mercedes  and  Ford  Taurus  LQs  built  after  1989. 


Approximate  query  answering 


Q:  List  all  flights  from  LA  to  New  York  de¬ 
parting  between  8  and  9  AM. 


Flight 

Departs 

From 

To 

UA  #2 

9:05 

LA 

New  York 

TWA  #10 

8:00 

LA 

Newark* 

helicopter  flight  (15  min,  $50)  and  shuttle  bus  (50  min, 
$20)  available  every  1/2  hour  from  Newark  to  New  York 
City. 


Examples  of  Cooperative  Query 

Answering 


Conceptual  query  answering: 


Q:  What  is  the  best  way  to  travel  by  air  from 
LA  to  New  York? 


Flight 

Dep 

From 

To 

Time 

Stops 

Cost 

UA  20 

10:00 

LA 

NY 

7 

Chii 

$325 

TWA  5 

9:00 

LA 

NY 

7 

Wash2 

$365 

TWA  1 

8:00 

LA 

Nwk^ 

5 

none 

$650 

UA  2 

9:05 

LA 

NY 

5 

none 

$725 

1  In  winter,  stopover  time  varies  with  weather  conditions. 

2  Recommended  for  winter  season. 

3  Surface  transportation  available  from  Newark  to  New 
York 


Neighborhood  Inference 


Approach 


•  Translate  a  query  on  missing  data  into  a 
related  query  on  available  data 


•  Provide  answers  with  different  degrees  of 
generality,  coverage  and  approximation 


Mechanisms 


Based  on  semantic  knowledge,  organize  data 
in  type  abstraction  hierarchy 

Provide  multi-level  knowledge  representation 

Support  inference  between  different  knowledge 
levels 

•  generalization 

•  speciaiization 

•  association 


+ 
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Type  Abstraction  Hierarchy 


Abstract  view  of  type  heirarchy 
Integrates: 

•  Subsumption  (is_a) 

•  Composition  (part-Of) 

•  abstraction 


Provides  multi-level  knowledge  representation 


Example  Abstraction  Hierarchy  II 


Multilevel  Abstraction  Hierarchy 


cheap 


COST 

reasonable 


expensive 


COST  ^ 
low  medium  high 


^LIGH^  CO^ 
low  medium  high 


^TRAINjCOST 
low  medmm  high; 


$100  $200  $275  $325  $350  $400  $475  $525 

^  Actual  Cost 


+ 


Type  Abstraction  Hierarchy 


Cross  Country 
Journeys 


Cross  Country 

Cross  Country 

Cross  Country 

Flights 

Trains 

Busses 

Delta 

Flights 


United  AT&SF  Amtrak  Greyhound 

Flights  Trains  Trains  Bus 


-f 
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Example  of  Multilevel  Type  Abstraction 

Hierarchy 


(LA,  Washington,  short,  expensive) 


JOURNEYS 


I 


j  FLIGHTS 

(LA,  Washington, 
morning,  afternoon, 

I  5,  medium)  > 


American  Flight  #753  \ 

LAX  to  Washington 
leaves  10  AM,  takes  5  hours 
costs  $450  y 


American  Flights 


Greyhound  Busses 


Generalization  and  Specialization 


Generalization 


SELECT  *  from  ccjourney 
WHERE  departure_area  =  "LA” 
AND  arrival_area  =  "Washington" 
AND  cost  =  "reasonable" 


generalization 


Cross  Country 
Journeys 


Cross  Country 
Flights 


SELECT  *  from  cc  flight 
WHERE  departure” area  =  "LA" 
AND  arrival_area  =  "Washington" 
AND  flight_cost  =  "low" 


generalization 


DELTA 

Flights 


SELECT  *  from  DELTA  flight 
WHERE 

departure_airport  =  "LAX" 

AND  arri^l  airport  =  "National" 
AND  fare  BETWEEN  <$250,  $300> 


Specialization 


Cross  Country 
Journeys 


SELECT  *  from  ccjourney 
WHERE  departure_area  =  ”LA” 
AND  arrival^aresTs  "Washington" 
AND  cost  =  "reasonable" 


specialization 


Cross  Country 
Trains 


SELECT  ♦  from  cc^train 
WHERE  departure  area  =  "LA" 

AND  arrival  areals  "Washington" 

AND  train^cost  WITHIN  {"high" /’medium"} 


specialization 


AT&SF 

Trains 


SELECT  *  from  Santa-Fe_traiii 
WHERE 

departure  station  WITHIN  {"LA",  "Long  Beach") 
AND  arrifal  station  WITHIN  {"DC  Union",  "Fairfax") 
AND  fare  BETWEEN  <$275,  $400> 


All  possible  Specializations 


SELECT  •  from  cc_flight 
WHERE  depanure_area  =  “LA“ 
AND  arrival.area  =  “Washmgion" 
AND  flight.cost  =  “low" 


Cross  Country 
Journeys 


SELECT  *  from  ccjourney 
WHERE  departure_area  =  "LA” 
AND  arrival__area  =  "Washington" 
AND  cost  =  "reasonable" 


SELECT  *  from  cc_irain  SELECT  *  from  cc_bus 

WHERE  departure.area  =  “LA"  WHERE  departure.aiea  =  “LA" 

AND  arrivai.area  =  “Washington"  AND  arrival.area  =  “Washington" 

AND  train.cost  WITHIN  ( "high" ."medium" )  AND  bus_cost  a  "high" 


Cross  Country 

Cross  Country 

Cross  Country 

Flights 

Trains 

Buses 

3 

6 

3 

3 

SELECT  *  from  DELTA.flight 
WHERE 


departure_airpon  WITHIN 
("LAX",  "Burbank",  "Long  Beach") 
AND  arTival_airport  WITHIN 
{"National",  "BWI",  “Dulles") 

AND  fare  BETWEEN  <S100,  S350> 


SELECT  •  from  Santa-Fe_irain 
WHERE 

depanure.stationWTTHlN  •  •  • 
•  (“LA".  “Long  Beach") 

AND  arrival  station  WITHIN 
(“DC  Union".  "Fairfax") 

AND  fare  BETWEEN  <$100,  S325> 


SELECT  *  from  Gieyhound.bus 
WHERE 


dq)anuie_stationWTrHIN 
{"LA  -  downtown",  "Hollywood") 
Af®  airival_station  WITHIN 
("DC  -  downtown",  "Rockville") 
AND  fare  BETWEEN  <$276.  $32S> 


DELTA 

lAmericani 

i  Amtrak  j 

AT&SF 

Greyhound 

Flights 

*  *  *  1  Flights  j 

j  Trains  i 

••••••••••••••••—•J 

Trains 

Bus 

■f 
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Relaxation  Mechanism 

Relaxable  Variables 


•  Types 

•  Attributes 


•  Attribute  Values 


4- 
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+  + 


Nearness  Measure 

Context  Dependent 

•  Time 


•  Space 


•  Concept 


Relaxation  Specifics 


+ 


+ 


Explicit  (by  user) 

•  relaxable  range  specified  by  C-SQL 

•  primitives  (relaxable  predicates) 

Implicit  (by  system) 

•  generalization  specialization 

•  based  on 

—  context 

—  type  abstraction  hierarchy 

—  abstract  domain  values 

Interactive 

•  through  user/system  interaction 


+ 
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C-SQL 

An  extended  Query  Language 


+ 
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+  + 

Primitives  (Predicates)  for  Cooperative 

Quety  Answering 

Context  Free 

•  approximate 

"9  AM 

•  inclusion 

between  (7  AM,  ll  AM) 

•  set  membership 

within  {‘LAX’, ‘Burbank’} 

Context  Sensitive  (nearness) 

•  Restaurant  near-to  ‘Redondo  Beach’ 

•  Airport  near-to  ‘LAX’ 

•  Chinese  Restaurant  nearest-to  ‘UCLA’ 

Relaxation  Order  may  be  specified 

•  relaxation-order  (food  style,  location) 

+ 
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Original  Query 


select  *  from  delta-flight 
where  dep-airport  =  “Newark” 
and  arr-airport  near-to  “Lax" 


+ 
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Relaxed  Query 


select  *  from  delta-flight 
where  dep-airport  =  “Newark” 
and  arr-airport  within 
{“Burbank",  “Long-Beach",  “Lax”} 


delta-flight 


dep_airport 

arr_alrport 

dep-time 

msm 

151 

NEWARK 

LAX 

845 

1319 

400 

367 

NEWARK 

BURBANK 

1000 

1428 

350 

29 

NEWARK 

LONG-BEACH 

935 

1422 

300 

+ 
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Nearer 


select  *  from  delta-flight 
where  dep-airport  =  “Newark” 
and  arr-airport  within 
{“Burbank”,  “Long-Beach”,  “Lax”} 


delta-flight 


dep.airport 

arr.airport 

dep-tjme 

KmnfiinB 

da 

151 

NEWARK 

LAX 

845 

1319 

400 

367 

NEWARK 

BURBANK 

1240 

1708 

350 

+ 


+ 


Nearer 

select  *  from  delta-flight 
where  dep-airport  =  “Newark" 
and  arr-airport  within 
{“Burbank",  “Long-Beach",  “Lax"} 


delta-flight 


dep_airport 

arr.airport 

dep_time 

arr_tlme 

fare 

151 

NEWARK 

LAX 

845 

1319 

400 

+ 


+ 


Original  Query 

select  *  from  restaurants 
where  type  =  “Chinese" 
and  location  near-to  “Lax” 


+ 
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Relaxed  Query 


select  *  from  restaurants 
where  type  =  “Chinese" 
and  location  within 
{  Westchester”  ,  “Inglewood”  ,  “Lax”  } 


restaurants 


name 

type 

location 

address 

MANDARIN  WOK 

Chinese 

Inglewood 

424,  Imperial  Hwy. 

Original  Query 


select  *  from  american-flight 
where  dep-airport  =  “National" 
and  arr-airport  near-to  “Lax” 
and  dep-time  between  (1000  *1100) 
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+ 


+ 


Relaxed  Query 

select  *  from  am  erica  n-f  light 
where  dep-airport  =  “National” 
and  arr-airport  within 
{“Lax",  “Burbank”,  “Long-Beach”} 
and  dep-time  between  (1000  1200) 


american-flight 


~FW 

dep.airport 

arr.airport 

dep_time 

arr_tjme 

fare 

305 

NATIONAL 

LAX 

1200 

1624 

385 

Patterns 


Specified  by  query  conditions  one  or  more 
attributes 

•  arrival  time  =  10  AM 

•  departure  airport  =  LAX 

Advantages 

•  Finer  granularity  than  types 

•  More  specific  knowledge  representation 

•  Complex  queries  can  be  derived  from  log¬ 
ical  operations  on  patterns 

•  Unified  interface  between  KB  and  DB 


+ 
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Association 


KB  IF  fliaht  destination  X  has  bad  weather  (heavy  fog.  icy  runway) 
THEN  flight  may  be  delayed  or  directed  to  nearby  airports 


coast-to-coast  (  LA,  Washinton,  morning,  afteimoon,  expensive) 


generalization 

Patterns  i — i  - 

□  U  □  d 

AA-flight(LAX,  Dulles,  10am,  6pm,  $500) 

□ 

□□ 

pattern  deduction 


Original  Query 


select  *  from  delta-flight 

where  dep-airport  =  “Lax” 

and  arr-airport  =  “O’ Hare” 

and  arr-time  between  (1700  1900) 


delta-flight 


R# 

dep-airport 

arr_airport 

dep.time 

arr_time 

fare 

1226 

LAX 

O’Hare 

820 

1825 

356 

Pattern:  arr-time  between  (1700  1900) 
Pattern  name;  rush-hour-arrival-time 


Associated  Subject:  Surface  Traffic  Conditions 

Information  derived  from  association: 

Extensive  traffic  delays  on  major  freeways 
between  airport  and  downtown. 


+  + 

Pattern:  dep-airport  =  “Lax” 

Pattern  name:  los-angeles— dep-airport 


Associated  Subject:  Weather 

Information  derived  from  association: 
Adverse  weather  conditions  over  LA  in 
winter  might  delay  flights. 


Pattern:  arr-airport  =  “O-Hare” 
Pattern  name:  Chicago-arr-airport 


Associated  Subject:  Weather 

Information  derived  from  association: 
Snowstorms  over  Chicago  in  winter  might 
divert  flights  to  other  airports. 
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Implementation 


Database  Management  System 

Sybase 

Knowledge  Representation  Package 

LOOM 
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+  + 


Cooperative  Query  Answering 
Architecture 


Query  Modification 


+ 
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Flow  Diagram 


SQL  Query 


Abstraction  Representations 


Graphical  Representation 


Los  Angeles 


Washington  D.C. 


region 


LAX 


Burbank  \ 

Long  Beach 


BWI 


National 


Dulles 


airport 


Logical  Representation 


Area _ Airport _ 

Los  Angeles  LAX,  Burbank,  Long  Beach 
Washington  Dulles,  Balt.,  National 
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Abstraction  Representations,  cntd. 

LOOM  Representation 

•  (defconcept  area  :is  :prinnitive) 

•  (defconcept  airport  :is  rprimitive) 

•  (defrelation  abstraction-of) 

•  (tellm  (area  los-angeies)) 

•  (tellm  (area  Washington)) 

•  (tellm  (airport  lax)) 

•  (tellm  (airport  national)) 

•  (tellm  (abstraction-of  los-angeles  lax)) 

•  (tellm  (abstraction-of  Washington  national)) 


Nearness  Representations 


Graphical 


Japanese 


Matrix 


Japanese 

Chinese 

Japanese 

u 

Indian 

Chinese 

1 

0 

French 

Indian 

4 

4 

0 

Spanish 

French 

9 

9 

7 

0 

- 9 - 

b 

3 

0 

+ 
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Nearness  Measures  are  Context  Sensitive 


attribute 

unit  of  semantic  distance 

departure  time 

one  hour 

flight  time 

ten  minutes 

Airport 

100  miles 

Restaurant 

5  miles 

Summary 


Cooperative  Query  Ansvy/ering  provides 
approximate  answers 
conceptuai  answers 
answers  to  imprecise  queries 

Type  Abstraction  Hierarchy 

framework  for  deriving  answers  to 
cooperative  queries 

CSQL 

provides  explicit  and  implicit  relaxation 
facilities  for  both  types  and  attributes 

Relaxation  based  on 
query  co^ntext 
nearness  measures 
relaxation  order 
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Continuing  Work 


Nearness  Measures 
Association  of  Subjects 
Knowledge  maintenance 
Scalability 
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APPENDIX  K 


MONOTONE  DATABASES 


(Imprecise  Query  Processing) 


J.  W.  S.  Liu,  S.  Vrbsky  and  X.  Song 

Real-Time  Systems  Laboratory 
Department  of  Computer  Science 
University  of  Illinois,  Urbana,  IL 


Imprecise  computation  technique  and 
monotonicity 

Approximate  relational  algebra  and  an 
approximation  semantics  for  set-valued 
queries 

Query  processing  strategy  for  returning 
monotonically  improving  answers 
Future  work 


Providing  Approximate  Answers 


An  approach  to  providing  flexibility  in 
scheduling  and  concurrency  control 


C 


Exact  Answers  Vs  Approximate  Answers 


Related  Work  on 
Approximations  in  Databases 

Incomplete  databases  —  Tzvieli,  Lispki,  Imielinski, 
Winslett 

Vague  and  generalized  queries  —  Motro,  Chauhuri 
Disjunctive  data  —  Liu  and  Sunderraman 

Monotone  query  processing 

o  set-valued  results,  missing  data  in  mle-based 
systems  —  Buneman,  Davidson  and  Watters 
o  statistical  databases  —  Ozsoyoglu  and  Ozsoyoglu 


Where  can  flight  United  941  land? 


The  Exact  Relation 


Location 

Appro.  Dist 

Chi-O’Hare,  IL 

350 

Manchester,  NH 

250 

Peoria,  IL 

400 

Syracuse,  NY 

100 

An  Approximate  Relation 


Location 

Appro.  Dist. 

Chi-O’Hare,  IL 

350 

Peoria,  IL 

400 

Paris,  EL 

380 

Hell,  MI 

330 

Manchester,  NH 

250 

Berlin,  MA 

300 

Syracuse,  NY 

100 

Certain  tuples 


Possible  tuples 
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What  is  an  approximate  answer? 


•  Observations: 

—  An  answer  to  a  relational  query  is  a  relation  R , 
that  is,  a  set  of  tuples. 

—  A  subset  of  R  approximates  the  set. 

—  A  superset  of  R  approximates  the  set. 

•  A  semantics  of  approximation  for  set-valued  queries: 

—  An  approximation  of  R  has  two  parts: 

0  The  set  C  of  certain  tuples  is  a  subset  of  R . 

0  The  set  P  contains  possible  tuples  in  R . 
o  The  union  of  C  and  P  is  a  superset  of  R . 

—  The  worst  approximation  of  R  is  one  which  contains 
no  certain  tuples. 

—  The  best  approximation  of  R  is  itself. 
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Location 

Chi-O’Hare,  IL 

350 

Peoria,  IL 

400 

Syracuse,  NY 

100 

Paris,  IL 

380 

Hell,  MI 

330 

Manchester,  NH 

250 

Berlin,  MA 

300 

Location 

Chi-O’Hare,  IL 

300 

Peoria,  IL 

400 

Paris,  IL 

380 

Manchester,  NH 

250 

Berlin,  MA 

300 

Syracuse,  NY 

100 

K- 
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iiliiil; 

Approximate  Relational  Algebra  Model 

•  Approximate  relation  R : 

—  subset  and  superset  of  a  standard  relation  S 

—  certain  tuples  C  and  possible  tuples  P 
—  R  approximates  S  if: 

CC^andCuP^^ 

•  Partial  order  over  set  of  approximate  relations: 

R.>  R.  if: 

*  —  ; 

C.  D  C.,  P.  C  P.,  C,  -  C,  C  P,  -  P, 

1  —  j’*  —  j  —  j  * 

•  Partial  order  set  is  a  lattice: 

(0,  v)  -  worst  approximation 
(Sj  0)  -  best  approximation 
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Approximate  Relational  Algebra  Operations 


Union,  set  difference,  select,  project,  cartesian  product 
•  Every  approximate  operation  is  monotone 


Approximate  Operation 

Ct 

Pt 

Union:  Rj>  ^  R^\J  Rj 

(7y  -  (7,  U  C, 

Pr.{P,VP^-CT 

Difference:  Rj.  ^  R^—  R^ 

*  Cj  -  R^ 

Pr^{P,-R^U{P.nR^) 

Select:  R^  ** 

Project:  Rj.  *  *”^^1 

Cart.  Prod:  ^  X  R^ 

Cji  *  Cj  X  Cj 

P j.  *  (ffj  X  iZj)  —  Cj. 
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Monotone  Query  Processing 
At  Each  Node 


•  Approximate  operations 

•  Approximate  relations  initialized  to  (0,  v) 

•  Monotonically  improving  answers 

—  tuples  migrate  from  possible  to  certain  part 

—  improved  values  propagate  upward  with 

each  update 
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APPROXIMATE 

{A  Prototype  Monotone  Query  Processor) 


•  Supports  processing  of  relational  algebra  queries  on 
relational  database  systems 

•  Uses  an  objected-oriented  approach  to  implementation: 

—  relies  on  an  object-oriented  view  for  semantic 
support  in  identification  of  initial  approximations 

—  avoids  operations  on  possible  tuples 

—  provides  lazy  evaluation  of  possible  classes  upon 
user  interrupt  or  faults 

•  Improves  data  availability  and  query  processing  time 
predictability 
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Approximate  Query  Processing 
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Object-Oriented  Monotone  Query  Processor 


•  How  to  implement  the  monotone  query  processor? 

—  initial  approximations 

good  approximation 

—  approximate  relational  algebra  operations 

operations  on  possible  tuples 


•  Object-oriented  monotone  query  processor 

—  semantic  support  for  efficient  implementation 

—  external  view  is  relational 

—  underlying  relational  architecture  unchanged 
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Approximate  Class 


C 


\ 

\ 

\ 


^  relational  algebra 
operator  or  read  request 


Possible  Classes 


Possible 

classes 


{possible  tuples} 
P 
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Approximate  Object 


•  Approximate  object  R  =  (C,  P) 

•  Describes  a  set  of  approximate  relations  of  S 

R.  >  R.  if: 

t  —  j 

C.  DC.,  P.CP., 

C‘  —  C  G  instances  of  P.  —  P. 

t  j  —  j  * 

If  R .  >  R .  then  R  >R. 

t  —  j  »  —  j 
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Approximate  Class  Methods 


Approximate  class  contains  four  methods: 
new 

generalization 

specialization 

migration 

Methods  are  monotone 


Methods  invoked  during  query  processing 


A.pproxinifl>t<e  Rol&tionsl  Algebrsi  Opcr&tions 


•  Redefined  using  generalization  method  (genl) 


Approxiinat*  Op«rmtion 

Pf 

Union:  Rji  Rj  U  R) 

Cf  *■  C|  U  U| 

Pr-(P,UP^-l«l(C,) 

Difftrtnct:  Rf  «  Rj  —  R, 

c,  -  c,  -  c, 

P,-(Pj-I«il«7,)-P^ 

u(P,ngoai((7j)nPi) 

Pr  Pi 

Project:  Rf  -  'jo®! 

Cj.  •  Cj 

Pr  “  *•«  Pi 

^»rt.  Prod:  R^  »  R^  X  R, 

Cy  ■  Cj  X  Cj 

i 

Pr  *  ((*««l{^i)  X  Pi)  U  (genl(C7i)  X  Pj) 

U  (f«iU(C7^  X  Pi)  U  (f*nl(C,)  X  Pj): 
-S«l(Cr) 

Summary 


•  Object-oriented  query  processor: 

—  identifies  initial  approximation 

—  avoids  operations  on  possible  tuples 

•  F  uture  work: 

—  implement  object-oriented  query  processor 

examine  overhead  due  to  class  hierarchy 

—  class  lattice  with  multiple  inheritance 

—  single-valued  queries 

—  general  approximate  model  using  distance 
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Single-Valued  Query  Approximations 


Min-Max  Queries 

"What  is  the  longest  runway  at  the  Rome  airport?" 

shortest  ^  ^  longest 

length  V  length 

start  at  the  longest 


Value-of  Queries 

"What  isihe  direction  of  runway  #20?" 
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Semantic  Support 

Distance  function: 

determines  which  class  is  closest 


"maroon" 


Muiti-Dimensionai  (Muitiple  Attributes): 

"What  is  the  airport  with  the  longest  runway 
in  the  smallest  town?" 

longest 
runway 


smallest 

town 


Multiple  Classes  -  different  strategies 
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Yes/No  Queries 


Approximate  Answers 

No  -  30%,  No  -  80% 

Comparison  Queries 

"Is  the  runway  at  Paris  at  least  8000  feet?" 


start  at  8000  ft. 


Process  values  8000, 9000, 10000 , ... 
or  7000,  6000,  5000, ... 

Existential  Queries 

"Is  there  an  airport  in  Vienna,  IL  ?" 
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Work  in  progress 

—  Monotone  query  processing  when  the 
database  contains  complete  information 
and  queries  are  precise. 

Future  work 

—  Monotone  query  processing  in  the 
presence  of  imprecise  or  incomplete 
information 

—  Imprecise  updates 
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Queries  and  Assertions 
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Time  Tokens 
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Path  Consistency 


user’s  response 


Temporal  Conditions  and  Caching 
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Temporal  Indexing 
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of  the  size  of  the  database. 
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Highly  Programmable  Architectures  (Phase  1)  Objectives 

1.  Assess  status  of  data-flow  research,  U.S.  and  foreign 

2.  Test  data-flow  architecture  for  non-numeric  application 

3.  Define  strategy  for  extrapolation  to  larger  non-numeric  applications 


Highly  Programmable  Architectures  (Phase  1)  Results 

1.  Micro  instruction  sets  appear  sufficient  for 
implementing  some  knowledge-based  applications 

2.  Macro  data-flow  actors  should  provide  higher  efficiency 
than  micro  actors 

3.  High-level  data-flow  languages  appear  to  have 
expressive  power  required  for  natural  language  applications 

4.  Data-flow  principles  can  deliver  runtime  parallelism  to 
natural  language  applications  with  little  compiler  effort. 

5.  Further  study  should  be  conducted  on  larger  examples  of 
natural  language  and  knowledge-based  applications. 


Programming  Parallel  Architectures  (Phase  2)  Objectives 

1.  Specify  requirements,  design,  and  implement 
application-development  environment 

for  multiprocessor  architectures, 
using  data-flow  principles 

2.  Test  environment  with  selected  natural  language 
and  knowledge-based  applications 

_  Language  Systems,  Inc. 
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Simple  Grammar  for  "The  parsing  algorithm  is  specified" 


NPVP  ->S 
Det  NG  ->  NP 
(Adj)N  ->NG 
Art  ->  Det 
Aux  VB  ->  VP 


Grammar  Symbols 

Adj  adjective 
Art  article 
Aux  auxiliary 
Det  determiner 
N  noun 
NG  noun  group 
NP  noun  phrase 
-  S  sentence 
VB  verb 
VP  verb  phrase 
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Chart  of  parses  for  "The  parsing  algorithm  is  specifled" 


Row 


Index 

(1) 

(2) 

(3) 

(4) 

(5) 

1. 

1 

1 

Det 

1 

0 

2. 

1 

2 

Adj 

2 

0 

3. 

1 

2 

VB 

3 

0 

4. 

1 

2 

N 

4 

0 

5. 

1 

3 

N 

5 

0 

6. 

1 

4 

Aux 

6 

0 

7. 

1 

5 

Adj 

7 

0 

8. 

1 

5 

VB 

8 

0 

9. 

2 

1 

NG 

1 

4 

10. 

2 

2 

NG 

2 

5 

11. 

2 

4 

VP 

6 

8 

12. 

3 

1 

NP 

1 

10 

13. 

5 

1 

S 

12 

11 

Resulting  phrases  (for  each  index) 

1.  The 

2.  parsing 

3.  parsing 

4.  parsing 

5.  algorithm 

6.  is 

7.  specified 

8.  specified 

9.  the  parsing 

10.  parsing  algorithm 

11.  is  specified 

12.  the  parsing  algorithm 

13.  the  parsing  algorithm  is  specified 
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Parse  tree  for  "The  parsing  algorithm  is  specified" 
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Proper  synchronization  of  many 
Processing  Units 


Programmability 

Large  number  of  processing  Elements 
Insure  deterministic  &  safe  termination 
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Must  find:  efficient,  architecture- 
transparent  programming  paradigms 
for  large-scale  multiprocessors 


Reliable  Operation 
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*  Reliability  estimation  (execution  of  a 
given  program) 


Performance  Evaluation 
And  Benchmarking 
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Larger  benchmarks 


Alternative  Models 
of  Computation 
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Self-organizing  features 
Robustness 

Simple  Individual  Evaluation  Elements 


Data-flow  Principles  of  Execution 


Interpretation  Models 


Processing  on  Macro  Data-flow 
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N  L  Syntactic  Processing 
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R3:  NP  art  &  NG  •  R7:  NG  adj  &  NG 
R4;  VP  -»  verb  &  PP 


Data-flow  Grammar  Graph 


^  sSI  i 

Sclllcf 

f  t  t  t  t  T  t 
«  H  ^  S:  i  i 


CO  o 

M  E 

=  <0 
0)  ” 

“O  <1> 

•-  JZ 

0)  ♦- 

15£ 

W  O 

o 

«  X 

^  ®  ® 
_  "O  "O 

<  (0  u 


■»-  c 

0) 

■Q  5 

o 

C  A 

.2  (0 

l| 

®  «  0) 

g  is 

S  o  o 

Q  o  (0 


CO  o 

o  E 
(0  > 
LU  (0 


M-18 


corresponds  to  a  rule. 


Data-flow  Actor 


Position 


Execution  of  an  Actor 


beginning  end 


Macro  Actors 


If  match,  produce  a  new  token. 
If  no  match,  store  in  the  pool. 
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©J 


[airplane^yS] 


Example: 

*'The  blue  airplane  flew  over  Kampala" 

1.  "The"* art  ->  R3:left-input  ->  R3:left-pool 

2.  "blue"«adj  ->  R7:left-input  ->  R7:left-pool 

j  3.  "airplane"  =  n  ->  R6:input 
L  ->  R2:input 


[The, 1,1] 


©' 


42^ 
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[airplaDe,33]y^^ 


.M 


R1 
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[blueA^ 


[airplane^] 


4.  R6:output  ->  R7:right-jnput 

->  R3;righMnput 

5.  R2:output  ->  R5:rlght-input 

->  R1  .-right-input 


m  [airplaiiey3»3] 


[airplane^^] 


[airplane^^] 


Language  Systems,  Inc 


R6 


6.  R7:output  ->  R3:right-input 
->  R7:right-input 
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R6 


7.  R3:output  ->  R5:right-input 
->  R1  :left-input 
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begin 

for  all  nodes  x,y,z  in  parallel  do 

if  (x,y  belong  to  M  and  x,y  |--  z)  then  add  z  to  M 
check  if  (S,0,n)  belongs  to  M 


Implications  (1) 
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guage  applications. 


Implications  (2) 
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in  the  conventional  (micro)-instruction  sets  of  the  data¬ 
flow  model.  (This  has  been  tested  on  a  small  phrase- 
structure  grammar  for  English). 


Directions  for  Future  Research  (1) 
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Directions  for  Future  Research  (2) 
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ceptionally  useful  in  the  design,  implementation  and 
test  of  data-flow  graphs  for  natural  language  process¬ 
ing  and  knowledge-based  systems. 
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OF 

ROME  LABORATORY 

Rome  Laboratory  plans  and  executes  an  interdisciplinary  program  in  re¬ 
search,  development,  test,  and  technology  transition  in  support  of  Air 
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Force  Command,  Control,  Communications  and  Intelligence  (C  l)  activities 
for  all  Air  Force  platforms.  It  also  executes  selected  acquisition  programs 
in  several  areas  of  expertise.  Technical  and  engineering  support  within 
areas  of  competence  is  provided  to  ESD  Program  Of f ices  (POs)  and  other 
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